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Abstract — Analytical methods for finding moments of random 
Vandermonde matrices with entries on the unit circle are devel- 
oped. Vandermonde Matrices play an important role in signal 
processing and wireless applications such as direction of arrival 
estimation, precoding, and sparse sampling theory, just to name 
a few. Within this framework, we extend classical freeness results 
on random matrices with independent, identically distributed 
(i.i.d.) entries and show that Vandermonde structured matrices 
can be treated in the same vein with different tools. We focus 
on various types of matrices, such as Vandermonde matrices 
with and without uniform phase distributions, as well as gener- 
alized Vandermonde matrices. In each case, we provide explicit 
expressions of the moments of the associated Gram matrix, 
as well as more advanced models involving the Vandermonde 
matrix. Comparisons with classical i.i.d. random matrix theory 
are provided, and deconvolution results are discussed. We review 
some applications of the results to the fields of signal processing 
and wireless communications. 

Index Terms — Vandermonde matrices. Random Matrices, de- 
convolution, limiting eigenvalue distribution, MIMO. 



I. Introduction 

Vandermonde matrices have for a long time had a central 
position in signal processing due to their connections with 
important tools in the field such as the FFT |1| or Hadamard 
(m transforms, to name a few. Vandermonde matrices occur 
frequently in many applications, such as finance |3|, signal ar- 
ray processing |4|, [5|, |6|, Q, ||8l, ARMA processes [9|, cog- 
nitive radio [10], security fTT]. wireless communications ifTSl . 
and biology |13|, and have been much studied. The applied 
research has been somewhat tempered by the fact that very 
few theoretical results have been available. 

A Vandermonde matrix with entries on the unit circle has 
the following form; 



/ 1 



V = 



1 



V 



-0{N-l)u>t 



) 



(1) 



We will consider the case where uji,...,u!l are i.i.d., taking 
values in [0, 27r). Throughout the paper, the oji will be 
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called phase distributions. V will be used only to denote 
Vandermonde matrices with a given phase distribution, and 
the dimensions of the Vandermonde matrices will always be 
N X L. 

Known results on Vandermonde matrices are related to 
the distribution of the determinant lfT4l . The large major- 
ity of known results on the eigenvalues of the associated 
Gram matrix concern Gaussian matrices L15i or matrices with 
independent entries. Very few results are available in the 
literature on matrices whose structure is strongly related to the 
Vandermonde case |,16J , [17 \. Known results depend heavily 
on the distribution of the entries, and do not give any hint 
on the asymptotic behaviour as the matrices become large. 
In the realm of wireless channel modeling, |18| has provided 
some insight on the behaviour of the eigenvalues of random 
Vandermonde matrices for a specific case, without any formal 
proof. 

In many applications, N and L are quite large, and we may 
be interested in studying the case where both go to cx) at a 
given ratio, ^ c. Results in the literature say very little on 
the asymptotic behaviour of ^ under this growth condition. 
The results, however, are well known for other models. The 
factor as well as the assumption that the Vandermonde 
entries e^^'^' lie on the unit circle, are included in ([TJ to ensure 
that the analysis will give limiting asymptotic behaviour 
Without this assumption, the problem at hand is more involved, 
since the rows of the Vandermonde matrix with the highest 
powers would dominate in the calculations of the moments for 
large matrices, and also grow faster to infinity than the 
factor in ([U, making asymptotic analysis difficult. In general, 
often the moments, not the moments of the determinants, are 
the quantities we seek. Results in the literature say also very 
little on the moments of Vandermonde matrices (however, 
see [!l6j), and also on the mixed moments of Vandermonde 
matrices and matrices independent from them. This is in 
contrast to Gaussian matrices, where exact expressions [19| 
and their asymptotic behaviour ||20| are known through the 
concept of freeness ll20l . which is central for describing the 
mixed moments. 

The framework and results presented in this paper are 
reminiscent of similar results concerning i.i.d. random matrices 
121] which have shed light on the design of many impor- 
tant wireless communication problems such as CDMA ll22ll . 
MIMO 1 23 1, or OFDM l24|. This contribution aims to do 
the same. We will show that, asymptotically, the moments 
of the Vandermonde matrices depend only on the ratio c 
and the phase distribution, and have explicit expressions. The 
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expressions are more involved than what was claimed in IfTSl . 
Moments are useful for performing deconvolution. Decon- 
volution for our purposes will mean retrieving "moments" 
tri((D(7V))*), ...,trL((D(iV))^) (where D(iV) are unknown 
matrices), from "mixed moments" of D(A^) and matrices 
on the form ([T]i, with L and N large. We are only able 
to perform such deconvolution when the D(A^) are square 
diagonal matrices independent from V. We will see that 
such deconvolution can be very useful in many applications, 
since the retrieved moments can give useful information about 
the system under study. Deconvolution has previously been 
handled in cases where V is replaced with a Gaussian ma- 
trix dsn, |l26l, im, ||27l. As will be seen, the way the phase 
distribution influences these moments can be split into several 
cases. Uniform phase distribution plays a central role in that 
it minimizes the moments. When the phase distribution has a 
"nice" density (for our purposes this means that the density 
of the phase distribution is continuous), a nice connection 
with the moments for uniform phase distribution can be given. 
When the density of the phase distribution has singularities, 
for instance when it has point masses, it turns out that the 
asymptotics of the moments change drastically. 

We will also extend our results to generalized Vandermonde 
matrices, i.e. matrices where the columns do not consist of 
uniformly distributed powers. Such matrices are important 
for applications to finance fS\. The tools used for standard 
Vandermonde matrices in this paper will allow us to find 
the asymptotic behaviour of many generalized Vandermonde 
matrices as well. 

While we provide the full computation of lower order 
moments, we also describe how the higher order moments 
can be computed. Tedious evaluation of many integrals is 
needed for this. It turns out that the first three limit moments 
coincide with those of the Marchenko Pastur law ll20l . ll28l . 
For higher order moments this is not the case, although we 
state an interesting inequality involving the Vandermonde limit 
moments and the moments of the classical Poisson distribution 
and the Marchenko Pastur law. 

The paper is organized as follows. Section provides 
background essentials on random matrix theory needed to state 
the main results. Section |III] states the main results of the 
paper. It starts with a general result for the mixed moments 
of Vandermonde matrices and matrices independent from 
them. Results for the uniform phase distribution are stated 
next, both for the asymptotic moments, and the lower order 
moments. After this, the nice connection between uniform 
phase distribution and other phase distributions is stated. The 
case where the density of lo has singularities is then handled. 
The section ends with results on generalized Vandermonde 
matrices, and mixed moments of (more than one) independent 
Vandermonde matrices. Section |IV] discusses our results and 
puts them in a general deconvolution perspective, comparing 
with other deconvolution results, such as those for Gaussian 
deconvolution. Section|V]presents some simulations and useful 
applications showing the implications of the presented results 
in various applied fields, and discusses the validity of the 
asymptotic claims in the finite regime. First we apply the 
presented Vandermonde deconvolution framework to wireless 



systems, where we estimate the number of paths, the transmis- 
sions powers of the users, the number of sources, and what 
is commonly referred to as wavelength. Finally we apply the 
results on Vandermonde matrices to the very active field of 
sparse signal reconstruction. 

II. Random matrix background essentials 

In the following, upper (lower boldface) symbols will be 
used for matrices (column vectors), whereas lower symbols 
will represent scalar values, {.)'^ will denote transpose opera- 
tor, (.)* conjugation, and (.)^ — ((•)"'") hermitian transpose, 
li will represent the L x L identity matrix. We let Tr be the 
(non-normalized) trace for square matrices, defined by. 



TriA)=J2' 



where an are the diagonal elements of the L x L matrix A. 
We also let tr^ be the normalized trace, defined by tri(A) — 
iTr{A). 

Results in random matrix theory often refer to the empirical 
eigenvalue distribution of matrices: 

Definition 1: With the empirical eigenvalue distribution of 
an L X L hermitian random matrix T we mean the (random) 
function 

#{*|A,;<A} 



L 



(2) 



where are the (random) eigenvalues of T. 

In the following, DriN),! < r < n will denote non- 
random diagonal L x L matrices, where we implicitly assume 
that ^ c. We will assume that the Dr(A^) have a joint 
limit distribution as TV ^ cx) in the following sense: 

Definition 2: We will say that the {Dr(A^)}i<r<n have a 
joint limit distribution as ^ oo if the limit 



lim trL{^D^,{N)■ 

N^oo 



(3) 



exists for all choices of ii, ig G {1, .., n}. 

The matrices Di(iV) are assumed to be non-random 
throughout the paper However, all presented formulas extend 
naturally to the case when (N) are random and independent 
from the Vandermonde matrices. The difference when the 
Di{N) are random is that expectations of products of traces 
also come into play, in the sense that, similarly to expressions 
on the form ([3]l, expressions of the form 



lim 



E[ trLCD^.iN)- 



■D..(iV))- 



also enter the picture. Our framework can also be extended 
naturally to compute the covariance of traces, defined in the 
following way: 

Definition 3: By the covariance Cij of two traces tri(A') 
and trL{A^) of an L x L random matrix A, we mean the 
quantity 



C,,,(A) = E[trL{A^)trL{A')] 

-E [tvL (A^)] E [trL (AJ)] 



(4) 
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When A is replaced with an ensemble of matrices, A^, 
the limits limi^oo -^Ci.j(Ai) are also called second order 
moments. 

The normalizing factor L is included in order to obtain 
a limit. It will be explained later why this is the cor- 
rect normalizing factor for the matrices we consider. The 
term second order moment is taken from |29|, where dif- 
ferent matrix ensembles were considered. For these ma- 
trices, the second order moments were instead defined as 
limi^oo i^Cj J (Al) (i.e. a higher order normalizing factor 
was used), since these matrices displayed faster convergence 
to a limit. We will present expressions for the second order 
moments limAr^oo iCij(D(A^)V^V). 

Most theorems in this paper will present expressions for 
various mixed moments, defined in the following way: 

Definition 4: By a mixed moment we mean the limit 

M„ = liiHAT^oo E[trL{ T>^{N)W"YTi2[N)Y"Y 
••• xD„(iV)V^V)], 



(5) 



whenever this exists. 

A joint limit distribution of {Dr(A^)}i<r<n is always 
assumed in the presented results on mixed moments. Note that 
when 'Di{N) = • • • = D„(A^) = II, the M„ compute to the 
asymptotic moments of the Vandermonde matrices themselves, 
defined by 



lim E 

N^oc 

lim E 



}rL (( 



Similarly, when Di(iV) = • • • = D„(A^) = D(A^), we will 
also write 

Dn^ lim irL(D(7V)"). (6) 

Note that this is in conflict with the notation Di^^,,,^i^, but the 
name of the index will resolve such conflicts. 

To prove the results of this paper, the random matrix 
concepts presented up to now need to be extended using 
concepts from partition theory. We denote by J'(n) the set of 
all partitions of {1, n}, and use p as notation for a partition 
in 7(71). Also, we will write p — {Wi, ...,Wk}, where Wj 
will be used repeatedly to denote the blocks of p, \p\ ^ k will 
denote the number of blocks in p, and | Wj \ will denote the 
number of elements in a given block. Definition |2] can now be 
extended as follows. 

Definition 5: For p — {Wi,...,Wk}, with Wi = 
{wii, ...,Wiiw,\}, we define 



= A, 
fc 



Dp = 1[Dw,- 



(7) 
(8) 



i=l 



To better understand the presented expressions for mixed 
moments, the notion of free cumulants will be helpful. They 
are defined in terms of noncrossing partitions lf30l . 

Definition 6: A partition p is called noncrossing if, when- 
ever we have i < j < k < I with i ^ k, j ^ I meaning 
belonging to the same block), we also have i ^ j ^ k ^ I 



(i.e. i, j, fc, I are all in the same block). The set of noncrossing 
partitions of {1, , , ., n} is denoted NG{n). 

The noncrossing partitions have already shown their use- 
fulness in expressing what is called the freeness relation in a 
particularly nice way fSOl. 

Definition 7: Assume that Ai,...,A„ are L x i-random 
matrices. By the free cumulants of Ai, A„ we mean the 
unique set of multilinear functionals if > 1) which satisfy 



E [trL (A,, • • • A, 



for all choices of «i, , 



E 

peNC{n) 



(9) 



where 



Kp[Aij, Ai,J = J]^ Kw-JAj^, Aj„] 
i=i 

KwJAij, Ai„] K|VK,|[Ai^^^,...,Ai^^|^^^J, 

where p = {Wi, ...,Wk}, with Wi = {wa, ...,WiiWi\}- By the 
classical cumulants of Ai, A„ we mean the unique set of 
multilinear functionals which satisfy (|9]l with NC (n) replaced 
by the set of all partitions CP(n). 

We have restricted our definition of cumulants to random 
matrices, although their general definition is in terms of more 
general probability spaces (Lecture 11 of ||30ll ). (|9]l is also 
called the (free or classical) moment-cumulant formula. The 
importance of the free moment-cumulant formula comes from 
the fact that, had we replaced Vandermonde matrices with 
Gaussian matrices, it could help us perform deconvolution. 
For this, the cumulants of the Gaussian matrices are needed, 
which asymptotically have a very nice form. For Vandermonde 
matrices, it is not known what a useful definition of cumulants 
would be. However, from the calculations in Appendix |A] it 
will turn out that the following quantities are helpful. 

Definition 8: For p E Tin), define 



K. 



7Vn + l-|p| ^ 

i N 

c n" 

J(0,27r)IPl llfc=l i_^3{^b(k-l)-'^b(k)) 

duji ■ ■■dujip\, 



(10) 



where tuwi , ■■■^^w^p^ are i.i.d. (indexed by the blocks of p), all 
with the same distribution as uj, and where b{k) is the block 
of p which contains fc (notation is cyclic, i.e. 6(0) = b{n)). If 
the limit 

Kp^^ = lim Kp^^^N 

N^oo 

exists, then it is called a Vandermonde mixed moment expan- 
sion coefficient. 

These quantities do not behave exactly as cumulants, but 
rather as weights which tell us how a partition in the moment 
formula we present should be weighted. In this respect our 
formulas for the moments are different from classical or 
free moment-cumulant formulas, since these do not perform 
this weighting. The limits Kp^^ may not always exist, and 
necessary and sufficient conditions for their existence seem 
to be hard to find. However, it is easy to prove from their 
definition that they do not exist if the density of oj has 
singularities (for instance when the density has point masses). 
On the other hand. Theorem [3] will show that they exist when 
the same density is continuous. 
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J'(n) is equipped with the refinement order < ||30l . i.e. 
Pi < P2 if and only if any block of pi is contained within a 
block of p2. The partition with n blocks, denoted 0„, is the 
smallest partition within this order, while the partition with 1 
block, denoted 1„, is the largest partition within this order. In 
the following sections, we will encounter the complementation 
map of Kreweras (p. 147 of |f30l ), which is an order-reversing 
isomorphism of NC{n) onto itself. To define this we need the 
circular representation of a partition: We mark n equidistant 
points 1, ...,71 (numbered clockwise) on the circle, and form 
the convex hull of points lying in the same block of the 
partition. This gives us a number of convex sets Hi, equally 
many as there are blocks in the partition, which do not intersect 
if and only if the partition is noncrossing. Put names 1, ...,fi 
on the midpoints of the 1, (so that i is the midpoint of 
the segment from i to i + 1). The complement of the set UiiJi 
is again a union of disjoint convex sets Hi. 

Definition 9: The Kreweras complement of p, denoted 
K{p), is defined as the partition on {1, ...,n} determined by 



' j in K{p) 



, j belong to the same convex set Hk 



An important property of the Kreweras complement is that 
(p. 148 of 1301 ^ 



\Kip)\ 



1. 



(11) 



III. Statement of main results 



We first state the main result of the paper, which applies to 
Vandermonde matrices with any phase distribution. It restricts 
to the case when the expansion coefficients Kp,^ exist. Differ- 
ent versions of it adapted to different Vandermonde matrices 
will be stated in succeeding sections. 

Theorem 1: Assume that the {D,.(Af)}i<r<n have a joint 
limit distribution as iV ^ oo. Assume also that all Vander- 
monde mixed moment expansion coefficients Kp^^ exist. Then 
the limit 



M„ = lim 



N- 



also exists when 



.E[trL{ Di(7V)V^VD2(iV)V^V 
•••X D„(7V)V«V)] 

c, and equals 

^ Kp.^^c^P^-^Dp. 
pey(n) 



(12) 



(13) 



The proof of Theorem [T] can be found in Appendix |A] 
Theorem [T] explains how "convolution" with Vandermonde 
matrices can be performed, and also provides us with an 
extension of the concept of free convolution to Vandermonde 
matrices. It also gives us means for performing deconvolution. 
Indeed, suppose Di(A^) = • • • = D„(iV) = D(7V), and that 
one knows all the moments M„. One can then infer on the 
moments £)„ by inspecting (fTsT i for increasing values of n. 
For instance, the first two equations can also be written 



A, = 



Du = 



Ml 

M2- 



02 



where we have used (O, that the one-block partition Oi = li is 
the only partition of length 1, and that the two-block partition 
O2 and the one-block partition I2 are the only partitions of 
length 2. This gives us the first moments Di and D2 defined 
by since Di^ Di, Dq^ = Dl, and Di^ = D2. 



A. Uniformly distributed lo 

For the case of Vandermonde matrices with uniform phase 
distribution, it turns out that the noncrossing partitions play a 
central role. The role is somewhat different than the relation 
for freeness. Let u denote the uniform distribution on [0, 2tt). 

Proposition 1: The Vandermonde mixed moment expansion 
coefficient 



K. 



lim Kn 



exists for all p. Moreover, < Kp ^ < 1, the Kp^u are rational 
numbers for all p, and Kp u = 1 if and only if p is noncrossing. 

The proof of Proposition [T| can be found in Appendix |B] 
The same result is proved in lfT6l . where the Kp^u are given 
an equivalent description. The proof in the appendix only 
translates the result in |16| to the current notation. Due to 
Proposition [T] Theorem [T] guarantees that the mixed moments 
(fT2l) exist in the limit for the uniform phase distribution, and 
are given by dTsl) . The ifp,« are in general hard to compute for 
higher order p with crossings. It turns out that the following 
computations suffice to obtain the 7 first moments. 

Proposition 2: The following holds: 



^{{l,3},{2,4}},« 
^^{{1,4},{2.5},{3,6}},« 
^{{1,4},{2.6},{3,5}},« 
^{{1,3,5},{2,4,6}},« 
'f^{{l,5},{3,7},{2,4,6}},« 
'f'^{{l,6},{2,4},{3,5,7}},M 



3 
1 

2 
1 

2 
11 

20 
9 

20 
9 

20' 



The proof of Proposition |2] is given in Appendix |C] Com- 
bining Proposition [T] and Proposition |2] one can prove the 
following: 

Proposition 3: Assume Di{N) = ■ ■ ■ = D„(7V) = T){N), 
and that the limits 



(cM)„ = c lim E tTL ( (D(iV)V^V) " ) ^14) 



K 



{cD), 



c lim irL(D"(iV)). 



(15) 
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exist. When w = u, we have that 



TO2 
TO3 

TO4 
TO5 

TO6 
TO7 



d2+dl 

ds + 3^2 di + dl 

d4 + 44(^1 + -dj + 6rf2rf? + df 
o 

25 

d,.; + bdidi + —^3^2 + 10^3^1 ' 

O 



^5 

40 



d^di + 10^2^? + dl 



de + 6^5 dl + 12^4^2 + 15^4^1 + 
^dl + 50d3d2di + 20d3dl + 

lldl + AOdldj + 15rf2di +di 

49 o 
dr + Idedi + -^^5^2 + 21d5df + 

497 

— (^4^3 + 84^4(^2^1 + 35^4^1 + 



1057 



djdi 



693 
U 



^3^2 + 175^3^2^? 



280 

35d3dt + 77d^di + —dldl + 
21d2dl + dl. 

Proposition [3] is proved in Appendix |Dl Several of the 
following theorems will also be stated in terms of the scaled 
moments (fT4l)-(fT5]), rather than Af„,Z?„. The reason for this 
is that the dependency on the matrix aspect ratio c can be 
absorbed in to„, d„, so that the result itself can be expressed 
independently of c, as in the equations of Proposition [3] The 
same usage of scaled moments has been applied for large 
Wishart matrices IZTl . Similar computations to those in the 
proof of Proposition [3] are performed in [16|, although the 
computations there do not go up as high as the first seven 
mixed moments. To compute higher order moments, Kp ^ 
must be computed for partitions of higher order also. The 
computations performed in Appendix ICl andlDjshould convince 
the reader that this can be done, but that it is very tedious. 

Following the proof of Proposition [1] we can also obtain 
formulas for the second order moments of Vandermonde 
matrices. Since it is easily seen that Ci.„(D(A^)V^V) = 
C„4(D(7V)V^V) = 0, the first nontrivial second order 
moment is the following: 

Proposition 4: Assume that V has uniform phase distribu- 
tion, let dn be as in (fTSl l, and define 



nii 



Then we have that 



lim LQ^j (D(iV)V^V)) 



L — >oo 



+ 4:d:idi-dl + 4:d2di. 
3 



(16) 



(17) 



Proposition |4] is proved in Appendix |El and relies on the 
same type of calculations as those in Appendix ICl Following 
the proof of Proposition [T] again, we can also obtain exact 
expressions for moments of lower order random Vandermonde 
matrices with uniform phase distribution, not only the limit. 
We state these only for the first four moments. 



Theorem 2: Assume Di(7V) = D2(A^) 



set c 



and define 



cE 



tTL ( (D(Ar)V^V) 



4^^^) = ctrL(D"(iV)). 
When a; = w we have that 



D„(7V), 

(18) 
(19) 



'•2 



(NX) 



+ {df''^'^r 



2N- 



{NX)JNX) 



12iV" 



19 



N 



-3 



12N-' 
- 6N^^ 



8N 
10 

y 



I (df 



-6(l-iV- 



,(Af,L), ,(^',L)^2 



Theorem |2] is proved in Appendix |F] Exact formulas for the 
higher order moments also exist, but they become increasingly 
complex, as higher order terms N^^ also enter the picture. 
These formulas are also harder to prove for higher order 
moments. In many cases, exact expressions are not what we 
need: first order approximations (i.e. expressions where only 
the A^~^-terms are included) can suffice for many purposes. 
In Appendix IF] we explain how the simpler case of these first 
order approximations can be computed. It seems much harder 
to prove a similar result when the phase distribution is not 
uniform. 

An important result building on the results we present is the 
following, which provides a major difference from the limit 
eigenvalue distributions of Gaussian matrices. 

Proposition 5: The asymptotic mean eigenvalue distribu- 
tion of a Vandermonde matrix with uniform phase distribution 
has unbounded support. 

Proposition |5] is proved in Appendix |G] 

B. Lo with continuous density 

The following result tells us that the limit Kp ^^ exists for 
many w, and also gives a useful expression for them in terms 
of Kp,u and the density of u. 

Theorem 3: The Vandermonde mixed moment expansion 
coefficients Kp,^^ — limAr^oo Kp^^^^N exist whenever the 
density p^^ of uj is continuous on [0, 27r). If this is fulfilled, 
then 

r2iT 



K — K 



(27r 



ilp|-i 



Pi 



,{x)^P^dx 



(20) 



The proof is given in Appendix |H] Although the proof 
assumes a continuous density, we remark that it can be 
generalized to cases where the density contains a finite set of 
jump discontinuities also. In Section |V] several examples are 
provided where the integrals ( l20l i are computed. An important 
consequence of Theorem [3] is the following, which gives the 
uniform phase distribution an important role. 
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Proposition 6: Let V^^ denote a Vandermonde matrix with 
phase distribution uj, and set 



equation 



V^,n = lim E 

Af— ►oo 



tTL { (v5vj" 



Then we have that 



The proof is given in Appendix U An immediate conse- 
quence of this and Proposition |5] is that all phase distributions, 
not only uniform phase distribution, give Vandermonde matri- 
ces with unbounded mean eigenvalue distributions in the limit. 
Besides providing us with a deconvolution method for finding 
the mixed moments of the {Dr(Af)}i<r <n, Theorem [3] also 
provides us with a way of inspecting the phase distribution lo, 
by first finding the moments of the density, i.e. p^{x)'^dx. 
However, note that we can not expect to find the density of 
u! itself, only the density of the density of us. This follows 
immediately by noting that p^{x)''dx remains unchanged 
when the phase distribution uj is cyclically shifted. 



C. UJ with density singularities 

The asymptotics of Vandermonde matrices are different 
when the density of w has singularities, and depends on 
the density growth rates near the singular points. It will be 
clear from the following results that one can not perform 
deconvolution for such uj to obtain the higher order moments 
of the {Dr(A^)}i<r-<n, as only their first moment can be 
obtained. The asymptotics are first described for lo with atomic 
density singularities, as this is the simplest case to prove. 
After this, densities with polynomic growth rates near the 
singularities are addressed. 

Theorem 4: Assume that p^ — X^i^iPi'^^i atomic 
(where {x) is dirac measure (point mass) at a^), and denote 
by =Y.\=iP'^- Then 

lim E\Tri Di(7V)— V^VDafA^)— V^V 
...xD„(A^)lv^V)] 



:"-V") lim T]trL{n,iN)) 



Note here that the non-normalized trace is used. 

The proof can be found in Appendix |j] In particular. 
Theorem ID states that the asymptotic moments of -i-V^V can 
be computed from p'"^. The theorem is of great importance 
for the estimation of the point masses pi. In blind seismic 
and telecommunication applications, one would like to detect 
the locations a^. Unfortunately, Theorem |4] tells us that this is 
impossible with our deconvolution framework, since the p^'^\ 
which are the quantities we can find through deconvolution, 
have no dependency to them. This parallels Theorem |3] since 
also there we could not recover the density p^^ itself. Having 
found the p'"' through deconvolution, one can find the point 
masses pi, by solving for pi,p2,--- in the Vandermonde 



/ PI 



P2 

pI 




p 



(2) 



V : : : : / V : / V : 

The case when the density has non-atomic singularities is 
more complicated. We provide only the following result, which 
addresses the case when the density has polynomic growth rate 
near the singularities. 

Theorem 5: Assume that 

lim I a; — ai\^p^(x) = pi for some < s < 1 

X — >ai 

for a set of points ai,. with p^ continuous for w ^ 
ai, ...,ar. Then 

lim E\Tr( T>i(N)—V"VD2(N)—V"V 



■■•xD„(A^)^V«V)] 

n 

c"-ig(") lim T]trL{B,{N)) 



i=l 



where 



(n) _ 



2r(l-s)cos(ii^))"p("); 



/fO.ll" Hk 



-dxi ■ ■ ■ dXn, 



(21) 



and p*^"' = ^^p^^■ Note here that the non-normalized trace is 
used. 

The proof can be found in Appendix [Kl Also in this case 
it is only the point masses pi which can be found through 
deconvolution, not the locations a,. Note that the integral in 
(ISTT i can also be written as an m-fold convolution. Similarly, 
the definition of Kp^^^M given by (fTOl i can also be viewed as 
a 2-fold convolution when p has two blocks, and as a 3-fold 
convolution when p has three blocks (but not for p with more 
than 3 blocks). 

A useful application of Theorem |5] we will return to is the 
case when u ~ k sin(6') for some constant k (see (|39]l), with 9 
uniformly distributed on some interval. This case is simulated 
in Section lV-AI It is apparent from ( l40l i that the density goes to 
infinity near lu ~ ±k, with rate x^^^^. Theorem |5] thus applies 
with s = 1/2. For this case, however, the "edges" at ±7r/2 are 
never reached in practice. Indeed, in array processing |31 1, the 
antenna array is a sector antenna which scans an angle interval 
which never includes the edges. We can therefore restrict u; in 
our analysis to clusters of intervals [oi, fii] not containing ±1, 
for which the results of Section IllI-BI suffice. In this way, we 
also avoid the computation of the cumbersome integral (l2Tl i. 

D. Generalized Vandermonde matrices 

We will consider generalized Vandermonde matrices on the 
form 



V 



1 



-jLJV/(^)J'^i 



-jW(i)Ja;i. 



(22) 
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where / is called the power distribution, and is a function 
from [0, 1) to [0, 1). We will also consider the more general 
case when / is replaced with a random variable A, i.e. 



V = 



-jN\2LJL 



(23) 



with the Xi i.i.d. and distributed as A, defined and taking values 
in [0, 1), and also independent from the ujj. 

We will define mixed moment expansion coefficients for 
generalized Vandermonde matrices also. The difference is that, 
while we in Definition [8] simplified using the geometric sum 
formula, we can not do this now since we do not assume 
uniform power distribution anymore. To define expansion 
coefficients for generalized Vandermonde matrices of the form 
(l22T i. define first integer functions /jv from [0, — 1] to 
[0, TV - 1] by fN{r) ^[Nf Let pf^ be the correspond- 

ing density for f^. The procedure is similar for matrices of 
the form ( |23] ). The following definition captures both cases: 

Definition 10: For (|22]) and ( |23] ). define 

/(0,27r)IPl 

nLi (E.1"o' Pf« (r)e^-'-(-K.-i, -K.) ) 
dwi ■ ■ ■ <hj\p\ 

/(o,2.)ipi nLi (/>e^-^'^(-K'=-)-M.,)dA 
duji ■ ■ -dujipi, 

(24) 

where uJwi, ■■■,t^W; are as in Definition [8] If the hmits 



Kp.,uj.J 
^p.uj.X 



= lim K, 

TV— ►oo 



lim K, 



p.u.X.N ^ 



exist, then they are called Vandermonde mixed moment expan- 
sion coefficients. 

Note that ([T]i corresponds to (|22] | with f{x) — x. The 
following result holds: 

Theorem 6: Theorem [T] holds also with Vandermonde ma- 
trices ([TJ replaced with generalized Vandermonde matrices on 
either form ( |22] | or ( |23] l, and with Kp,^ replaced with either 
Kp^uij or Kp^^^x. 

The proof follows the same lines as those in Appendix lAl 
and is therefore only explained briefly at the end of that 
appendix. As for matrices of the form ([TJ, it is the case 
of uniform phase distribution which is most easily described 
how to compute for generalized Vandermonde matrices also. 
Appendix |B] shows how the computation of boils down 
to computing certain integrals. The same comments are valid 
for matrices of the form ( |22] | or ( |23]) in order to compute 
Kp^uij and Kp,uj.\. This is further commented at the end of 
that appendix. 

We will not consider generalized Vandermonde matrices 
with density singularities. 



E. The joint distribution of independent Vandermonde matri- 
ces 

When many independent random Vandermonde matrices are 
involved, the following holds: 

Theorem 7: Assume that the {Dr(Ar)}i<r<n have a joint 
limit distribution as ^ oo. Assume also that Vi,V2,... 
are independent Vandermonde matrices with the same phase 
distribution uj, and that the density of uj is continuous. Then 
the limit 

lim^^oo E\tri,{ Di(7V)V,ffV„D2(iV)V,f V,3 
•••xD„(7V)V«V,J] 



also exists when 



c. The limit is when n is odd, and 



equals 



p<crea'(Ti) 



p^uj^ ^ p-) 



(25) 



where a = {0^,02) = {{1, 3, 5, }, {2, 4, 6, ...}} is the 
partition where the two blocks are the even numbers, and the 
odd numbers. 

The proof of Theorem [T] can be found in Appendix |L] That 
appendix also contains some remarks on the case when the 
matrices D, (A^) are placed at different positions relative to 
the Vandermonde matrices. From Theorem |2l the following 
corollary is immediate: 

Corollary 1: The first three mixed moments 

= lini E \tTL ((Vf V2Vf Vi) 
of independent Vandermonde matrices Vi , V2 are given by 



(2) 



(2) 



(2) 



I2 

I,. 

20 



2/34 
^4/3 



h 

h9h 



6/5 



where 



2tt 



Pi^{x)''dx 



In particular, when the phase distribution is uniform, the first 
three mixed moments are given by 



(2) 



(2) 



(2) 



1 

11 

y 

411 
'20 



The results here can also be extended to the case with 
independent Vandermonde matrices with different phase dis- 
tributions: 

Theorem 8: Assume that {^i}i<i<s are independent Van- 
dermonde matrices, where V,; has continuous phase distribu- 
tion uji. Denote by p^^. the density of w,;. Then Equation (l25T l 
still holds, with Kp^^ replaced by 

Xp,„(27r)l''l-i r f[p^^{xyP^^dx, 

where pi consists of all numbers k such that ik = i- 

The proof is omitted, as it is a straightforward extension of 
the proofs of Theorem |3] and Theorem |71 
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IV. Discussion 

In the recent work |16|, the Vandermonde model ([T]l is 
encountered in reconstruction of multidimensional signals in 
wireless sensor networks. The authors also recognize a similar 
expression for the Vandermonde mixed moment expansion 
coefficient as in Definition [8] They also state that, for the case 
of uniform phase distribution, closed form expressions for the 
moments can be found, building on an analysis of partitions 
and calculation of volumes of convex polytopes described by 
certain constraints. This is very similar to what is done in this 
paper 

We will in the following discuss some differences and 
similarities between Gaussian and Vandermonde matrices. 

A. Convergence rates 

In fT9l, almost sure convergence of Gaussian matrices was 
shown by proving exact formulas for the distribution of lower 
order Gaussian matrices. These deviated from their limits by 
terms of order In Theorem^ we see that terms of order 

are involved. This slower rate of convergence may not be 
enough to make a statement on whether we have almost sure 
convergence for Vandermonde matrices. However, [32| shows 
some almost sure convergence properties for certain Hankel 
and Toeplitz matrices. These matrices are seen in that paper to 
have similar combinatorial descriptions for the moments, when 
compared to Vandermonde matrices in this paper. Therefore, it 
may be the case that the techniques in |[32l can be generalized 
to address almost sure convergence of Vandermonde matrices 
also. Figure [U shows the speed of convergence of the moments 
of Vandermonde matrices (with uniform phase distribution) 
towards the asymptotic moments as the matrix dimensions 
grow, and as the number of samples grow. The differences 
between the asymptotic moments and the exact moments are 
also shown. To be more precise, the MSB values in Figure [1] 
are computed as follows: 

1) K samples are independently generated using 

2) The 4 first sample moments Vji = j^trn ( (V^^Vi)"' 



+ MSE between exact atid asytnptotic moments 
o MSE between estimated and exact moments 



(1 < j < 4) are computed from the samples. 

3) The 4 first estimated moments Vj are computed as the 
mean of the sample moments, i.e. Vj = ^ ''^ji- 

4) The 4 first exact moments Ej are computed using 
Theorem |2l 

5) The 4 first asymptotic moments Aj are computed using 
Proposition [3] 

6) The mean squared error (MSE) of the first 4 esti- 
mated moments from the exact moments is computed 

as E-=i {vj~E,y. 

7) The MSE of the first 4 exact moments from the asymp- 
totic moments is computed as X]j=i i-^j ^ ^i)^ ■ 

Figure [T] is in sharp contrast with Gaussian matrices, as shown 
in Figure |2] First of all, it is seen that the asymptotic moments 
can be used just as well instead of the exact moments (for 
which expressions can be found in |33|), due to the 0(1/7V^) 
convergence of the moments. Secondly, it is seen that only 
5 samples were needed to get a reliable estimate for the 
moments. 



++++- 



O oqO 



(a) 80 samples 



MSE between exact and asymptotic moments 
MSE between estimated and exact moments 



cP O 
) O °0 i Cfa qjo ° ^ 



(b) 320 samples 

Fig. 1 . MSE of the first 4 estimated moments from the exact moments for 80 
and 320 samples for varying matrix sizes, with N = L. Matrices are on the 
form V^V with V a Vandermonde matrix with uniform phase distribution. 
The MSE of the first 4 exact moments from the asymptotic moments is also 
shown. 



B. Inequalities between moments of Vandermonde matrices 
and moments of known distributions 

We will state an inequality involving the moments of Van- 
dermonde matrices, and the moments of known distributions. 
The classical Poisson distribution with rate A and jump size 
a is defined as the limit of 



1 _ _ Uo + -(5„ 
n I n 



as oo IBOI . where * denotes classical (additive) convo- 
lution, and *Ti denotes n-fold convolution with itself. For our 
analysis, we will only need the classical Poisson distribution 
with rate c and jump size 1, denoted Vc- The free Poisson 
distribution with rate A and jump size a is defined similarly 
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+ MSE between exact and asymptotic moments 
o MSE between estimated and exact moments 



9l i 9 i i d) i W i i g i i V l 'rto i Vm i ffi i cb i i (D 



200 
N 



Fig. 2. MSE of the first 4 estimated moments from the exact moments for 
5 samples for varying matrix sizes, with N = L. Matrices are on the form 
-^XX^ with X a complex standard Gaussian matrix. The MSE of the first 
4 exact moments from the asymptotic moments is also shown. 



as the limit of 

1 - - Uo + -<5a 

n J n 

as n ^ oo, where ffl is the free probability counterpart of 
* ||30l . II20I . and where ffln denotes n-fold free convolution 
with itself. For our analysis, we will only need the free Poisson 
distribution with rate - and jump size c, denoted jic- jJ-c is the 
same as the better known Marchenko Pastur law, i.e. it has the 
density ll20l 



1, 



r=(a;) = (l--)+5o(x) + 



(26) 



2ttcx 

where (2:)+ = max(0, z), a = (1 — -y/c)^, b = (1 + ^)^. Since 
the classical (free) cumulants of the classical (free) Poisson 
distribution are Aa" 1, 30] , we see that the (classical) cumulants 
of i^c are c, c, c, c, and that the (free) cumulants of fic are 
1, c, c^, c^, .... In other words, if ai has the distribution /ic, 
then 



</.(a?) 



peNC{n) 
peJVC(n) 



E 



peJVC(n) 



(27) 

Here we have used the Kreweras complementation map and 
(fTTT i. with ip denoting the expectation in a non-commutative 
probability space ll20i . Also, if a2 has the distribution Vc, then 

M 



E 



(28) 



We immediately recognize the c''*! ^ -entry of Theorem [T] in 
(IZTT i and (l28T l (with an additional power of c in (l28Tl). Combin- 
ing Proposition □ with Di(A^) = • • • = D„(A^) = II, 
and (|28l l, we thus get the following corollary to Proposition [T] 
Corollary 2: Assume that V has uniform phase distribu- 
tion. Then the limit moment 



Vn = lim E 

N^oc 



satisfies the inequaUty 

where oi has the distribution /ic of the Marchenko Pastur law, 
and 02 has the Poisson distribution z/j,. In particular, equality 
occurs for m — 1,2,3 and c = 1 (since all partitions are 
noncrossing for m = 1,2, 3). 

Corollary |2] thus states that the moments of Vandermonde 
matrices with uniform phase distribution are bounded above 
and below by the moments of the classical and free Poisson 
distributions, respectively. The left part of the inequality in 
Corollary |2] was also observed in Section VI in [T6l. The dif- 
ferent Poisson distributions enter here because their (free and 
classical) cumulants resemble the cl''l~^-entry in Theorem [1] 
where we also can use that Kp ^ = 1 if and only if p is 
noncrossing to get a connection with the Marchenko Pastur 
law. To see how close the asymptotic Vandermonde moments 
are to these upper and lower bounds, the following corollary 
to Proposition [3] contains the first moments: 

Corollary 3: When c = 1, the limit moments 



Vn 



lim E 



tr. 



V»V) 



the moments /p„ of the Marchenko Pastur law /ii, and the 
moments p„ of the Poisson distribution vi satisfy 



fpA = 14 
/P5 = 42 
/P6 = 132 
fP7 = 429 



< 
< 
< 
< 



^4 = f 



§71 



14.67 
i 48.67 
w 178.55 
w 713.67 



< 
< 
< 
< 



Pi = 15 
P5 = 52 
Pq = 203 
P7 = 877. 



The first three moments coincide for the three distributions, 
and are 1,2, and 5, respectively. 

The numbers /p„ and p„ are simply the number of 
partitions in NC{n) and T(ri), respectively. The number 
of partitions in NC{n) equals the Catalan number C„ = 
II30I . and are easily computed. The number of 
partitions of J'(n) are also known as the Bell numbers i?„ ll30l . 
They can easily be computed from the recurrence relation 



Br, 



fc=0 ^ ^ 



In Figure [3] the mean eigenvalue distribution of 640 samples 
of a 1600 X 1200 (i.e. c = 0.75) Vandermonde matrix 
with uniform phase distribution is shown. While the Poisson 
distribution vi is purely atomic and has masses at 0, 1, 2, and 
3 which are e^^, e^^, e^^/2, and e^^/6 (the atoms consist 
of all integer multiples), the Vandermonde histogram shows a 
more continuous eigenvalue distribution, with the peaks which 
the Poisson distribution has at integer multiples clearly visible, 
although not as sharp. We remark that the support of V^V 
for a fixed N goes all the way up to N , but lies within 
[0, A^]. It is unknown whether the peaks at integer multiples 
in the Vandermonde histogram grow to infinity as we let 
A^ ^ 00. From the histogram, only the peak at seems to 
be of atomic nature. The effect of decreasing c amounts to 
stretching the eigenvalue density vertically, and compressing 
it horizontally, just as the case for the different Marchenko 
Pastur laws. An eigenvalue histogram for Gaussian matrices 
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Fig. 3. Histogram of the mean eigenvalue distribution of 640 samples of 
V^V, with V a 1600 x 1200 Vandermonde matrix with uniform phase 
distribution. 




0123456789 10 



Fig. 4. Histogram of the mean eigenvalue distribution of 20 samples of 
iXX^. with X an L X Af = 1200 X 1600 complex, standard, Gaussian 
matrix. 



which in the Hmit give the corresponding (in the sense of 
Corollary |2]i Marchenko Pastur law for Figure [3] (i.e. /io.75) is 
shown in Figure |4] Figure |5] shows an eigenvalue histogram 
in the case of a non-uniform phase distribution. Here we have 
taken 640 samples of a 1600 x 1200 Vandermonde matrix with 
phase distribution with density (|40] |. with A = 2d, a ^ j. This 
density, also shown in Figure |6] is used in the applications of 
Section IV-AI Experiments show that the eigenvalue histogram 
tends to flatten when the phase distribution becomes "less 
uniform", with a higher concentration of larger eigenvalues. 

It is unknown whether the inequalities for the moments can 
be extended to inequalities for the associated capacity. If X is 
an X iV standard, complex, Gaussian matrix, then an explicit 



10 



1 

0.9 
0.8 
0.7 
0.6 




0123456789 10 



Fig. 5. Histogram of the mean eigenvalue distribution of 640 samples of 
V^V, with V a 1600 X 1200 Vandermonde matrix with phase distribution 
defined in (40) with X = 2d,a = j. 

0.5 p , 1 , , , rn 

0.45 - 

0.4 - 
0.35 - 

0.3 - 




0.15- 
0.1 - 
0.05 - 

\ — I 1 1 1 1 — \ ^ 

-3-2-10123 

0) 

Fig. 6. The density Pcj(a') given by )40t . with A = 2d, a = ^. 

expression for the asymptotic capacity exists 128)1 : 

limw^oo jj log2 det (l^v + p (^XX^)) = 

21og2(l + p-i(V4^-l)') (29) 

-^(^/4^-l)^ 

In Figure Ilia), several realizations of the capacity are com- 
puted for Gaussian matrix samples of size 36 x 36. The 
asymptotic capacity ( l29t is also shown. In Figure |3b), several 
realizations of the capacity are computed for Vandermonde 
matrix samples of the same size, for the case of uniform phase 
distribution. It is seen that the variance of the Vandermonde 
capacities is higher than for the Gaussian counterparts. This 
should come as no surprise, due to the slower convergence 
to the asymptotic limits for Vandermonde matrices. Although 
the capacities of Vandermonde matrices with uniform phase 
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- Asymptotic capacity 
sampie capacity 



matrix) is 



(a) Realizations of log2 det (Ijv + p-^XX^) when X is standard, 
complex, Gaussian. The asymptotic capacity )29t is also shown. 



M.ii' 



... :l. .:■':!. 



(b) Realizations of log2 dot (Ijv + p VV^ ) when uj has uniform phase 
distribution. 

Fig. 7. Realizations of the capacity for Gaussian and Vandermonde matrices 
of size 36 X 36. 



distribution and Gaussian matrices seem to be close, we have 
no proof that the capacities of Vandermonde matrices are even 
finite due to the unboundedness of its support. 



C. Deconvolution 



Deconvolution with Vandermonde matrices (as stated in 
( fTsT l in Theorem [Til differs from the Gaussian deconvolution 
counterpart |30 | in the sense that there is no multiplicative |[30l 
structure involved, since Kp^ is not multiplicative in p. The 
Gaussian equivalent of Proposition[3](i e. V^V replaced with 
-i-XX^, with X an L X iV complex, standard, Gaussian 



mi 
m2 
ma 
m4 
m5 

me 



my 



d2 + d\ 
dz + 3(^2^1 
c?4 + Ad^di 
c?5 + bdidi 



dl 

2dl + 6^2 



5d3d2 + lOdadl 



dt 



lOdldi + 10^2^? + dl 

C?6 + 6c?5C?i + 6c?4C?2 + 15(^4^1 ~ 

3^3 + 30d3rf2di + 20rf3d? + 
5dl + lMld\ + lbd2d\ + dl 
dr + Idcdi + ld^d2 + 21^5^1 - 
Tdidz + A:2did2di + ZM^dl + 
2ldldi + 2ld'idl + 105^3^2^? 
35(^3^^ + 35d^di + IQdldl + 
21^2^1 + dl, 



(30) 
(31) 
(32) 
(33) 

(34) 



(35) 



(36) 



where the m^ and the di are computed as in (fT4l) -(fT5]). This fol- 
lows immediately from asymptotic freeness |20|, and from the 
fact that -^XX^ converges to the Marchenko Pastur law fic- 
In particular, when all Di(A^) = II and c = 1, we obtain the 
limit moments 1,2,5, 14, 42, 132, 429, which also were listed 
in Corollary [3] One can also write down Gaussian equivalents 
to the second order moments of Vandermonde matrices ( fTTb 
using techniques from |29|. However the formulas look quite 
different, and the asymptotic behaviour is different. We have 
for instance 



lim L''Di,i 



D(iV)lxX^ 



= cd2 



(37) 



where it is not needed that the matrices T){N) are diagonal. 
Similarly, one can write down an equivalent to Theorem |2] for 
the exact moments. For the first three moments (the fourth 
moment is dropped, since this is more involved), these are 



mi 
m2 
m3 



dl 
d2 
(1 



dl 



N-^) d3 + 3did2 + dl. 



This follows from a careful count of all possibilities after the 
matrices have been multiplied together (see also li33l , where 
one can see that the restriction that the matrices Di(A^) are 
diagonal can be dropped in the Gaussian case). It is seen, 
contrary to Theorem |2] for Vandermonde matrices, that the 
second exact moment equals the second asymptotic moment 
(I3TI 1. and also that the convergence is faster (i.e. 0{N~'^)) 
for the third moment (this will also be the case for higher 
moments). 

The two types of (de)convolution also differ in how they 
can be computed in practice. In lIZTl . an algorithm for free 
convolution with the Marchenko Pastur law was sketched. A 
similar algorithm may not exist for Vandermonde convolu- 
tion. However, Vandermonde convolution can be subject to 
numerical approximation: To see this, note first that Theorem|3] 
splits the numerics into two parts: The approximation of the 
integrals J p^{xyp^dx, and the approximation of the iiTp „. A 
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Strategy for obtaining the latter quantities could be to randomly 
generate many numbers between and 1 and estimate the 
volume as the ratio of the solutions which satisfy ( |63] l in 
Appendix |B] Implementations of the various Vandermonde 
convolution variants given in this paper can be found in 1341 . 

In practice, one often has a random matrix model where 
independent Gaussian and Vandermonde matrices are both 
present. In such cases, it is possible to combine the individual 
results for both of them. In Section [V] examples on how this 
can be done are presented. 

V. Applications 

The applications presented here all use the deconvolution 
framework for Vandermonde matrices. Since additive, white, 
Gaussian noise also is taken into account, Vandermonde de- 
convolution is combined with Gaussian deconvolution. Matlab 
code for running the different simulations can be found in |34||. 

In the eigenvalue histograms for Vandermonde matrices 
shown in figures [3] and |5j large matrices were used in order 
to obtain something close to the asymptotic limit. In practical 
scenarios, and in the applications we present, N and L are 
much smaller than what was used in these figures, which 
partially explains the uncertainty in some of the simulations. In 
particular, the uncertainty for non-uniform phase distributions 
such as those in Section IV-AI is high, since exact expressions 
for the lower order moments are not known, contrary to the 
case of uniform phase distribution. In all the following, d is the 
distance between the antennas whereas A is the wavelength. 
The ratio j is a figure of the resolution with which the system 
will be able to separate (and therefore estimate the position 
of) users in space. 

A. Detection of the number of sources 

Let us consider a basestation equipped with N receiving 
antennas, and with L mobiles (each with a single antenna) in 
the cell. The received signal at the base station is given by 



VP5s, +ni 



(38) 



Here is the x 1 received vector, s, is the L x 1 transmit 
vector by the L users which is assumed to satisfy E [s^s^] — 
1l, rii is A^ X 1 additive, white, Gaussian noise of variance 
(all components in and are assumed independent). 
In the case of a line of sight between the users and the base 
station, and considering a Uniform Linear Array (ULA), the 
matrix V has the following form: 



/ 1 

g-i27r4sin(ei) 



V 



... 1 N 

... g-j27r4sin(ei) 



y g-i27r(Ar-l)fsin(ei) ... g-j27rf sin(ei ) J 



(39) 

Here, 6i is the angle of the user in the cell and is supposed 
to be uniformly distributed over [—a, a]. is an L x i 
diagonal power matrix due to the different distances from 
which the users emit. In other words, we assume that the 
phase distribution has the form 27r^ sin(6') with 9 uniformly 
distributed on The fact that the phase has the form 



27rj sin(6') is a well known result in array processing ||3T| . 
The user's distribution can be known (in the case of these 
simulations, the uniform distribution has been accounted for 
without loss of generality) through measurements in wireless 
systems up to some parameters (here, a typically). This is 
usually done to have a better understanding of the user's 
behaviour. It is easily seen, by taking inverse functions, that 
the density is, when 21™!" < 



1 



2a 



iiT^d? _ ^2 
A2 



(40) 



], and elsewhere (see Figure |6]). 



Throughout the paper we will assume, as in Figure |5] 
that A ~ 2d, a — j when model ( [39] l is used. With this 
assumption, ^'^ ^"^ " < 1 is always fulfilled. 

The goal is to detect the number of sources L and their 
respective power based on the sample covariance matrix 
supposing that we have K observations, of the same order 
as A^. When the number of observation is quite higher than A^ 
(and the noise variance is known), classical subspace methods 
|f35l provide tools to detect the number of sources. Indeed, let 
R be the true covariance matrix given by 



VPV 



H 



where is the noise variance. This matrix has N — L 
eigenvalues equal to and L eigenvalues strictly superior 
to (7^. One can therefore determine the number of source 
by counting the number of eigenvalues different from a^. 
However, in practice, one has only access to the sample 
covariance matrix given by 

1 



W = — YY^, 
K 



with 



Y= [ri,...rK] - VP^ 



[ni, ...,nK\ 



(41) 



If one has only the sample covariance matrix W, we have three 
independent parts which must be dealt with in order to get an 
estimate of P: the Gaussian matrices S = [si,...,s/f] and 
N = [rii, ....uk], and the Vandermonde matrix V. It should 
thus be possible to combine Gaussian deconvolution ||33]| 
and Vandermonde deconvolution by performing the following 
steps: 

1) Estimate the moments of -^VP^SS^P^V-^ using 
multiplicative free convolution as described in fj?]. This 
is the denoising part. 

2) Estimate the moments of PV^V, again using multi- 
plicative free deconvolution. 

3) Estimate the moments of P using Vandermonde decon- 
volution as described in this paper. 

Putting these steps together, we will prove the following: 
Proposition 7: Define 



p^i^xY'dx, 



(42) 
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and denote the moments of P and the sample covariance 
matrix, respectively, by 



Then the equations 

Wi = C2P1 + 

W2 = C2P2 + {clh + C2C3){Plf 

W3 = C2P3 + i3cp2 + Sc2C3)PiP2 

+ {clh+3clc3l2+C2cl){Pif 
+3(t2(1+Ci)c2P2 

+3a\{l + ciylh + C3ic3 + 2c2)){Pif 
+3ct4(c? + 3ci + l)c2Pi 
+a^(c?+3ci + l) 



provide an asymptotically unbiased estimator for the mo- 
ments Pi from the moments of Wi (or vice versa) when 
limAT^oo = ci, liniAT^oc- ^ = C2, limAr^oc- ^ = C3. 

The proof of this can be found in Appendix [Ml Note that 
C3 = C1C2, so that the definition of C3 is really not necessary. 
We still include it however, since ci, C2 and C3 are matrix 
aspect ratios which represent different deconvolution stages, 
so that they all are used when these stages are implemented 
and combined serially. In the simulations. Proposition [T] is put 
to the test when P has three sets of powers, 0.5, 1, and 1.5, 
with equal probability, with phase distribution given by ( [39] l. 
Both the number of sources and the powers are estimated. 
For the phase distribution ( |39] |. the integrals I2 and can 
be computed exactly (for general phase distributions they are 
computed numerically), and are 



h = 



h = 



X , ^l + sm a 
■ In 



Ada^ \1 — sm a 
\^ tan a 



Under the assumptions A = 2d, a ~ j used throughout this 
paper, the integrals above take the values 



h = 



h = 



40 



In 



V2, 



1600 



For estimation of the powers, knowing that we have only 
three sets of powers with equal probability, it suffices to 
estimate the three lowest moments in order to get an estimate 
of the powers (which are the three distinct eigenvalues of P). 
Therefore, in the following simulations. Proposition |7] is first 
used to get an estimate of the moments of P. Then these are 
used to obtain an estimate of the three distinct eigenvalues of 
P using the Newton-Girard formulas |37|. These should then 
lie close to the three powers of P. Power estimation for the 
model ( [39] l is shown in the first plot of Figure [TO] In the plot, 
K — L — N = 144, and a ~ \/0.1. Experiments show that 
when the phase distribution becomes "less" uniform, larger 



matrix sizes are needed in order for accurate power estimation 
using this method. This will also be seen when we perform 
power estimation using uniform phase distribution in the next 
section. 

For estimation of the number of users L, we assume that 
the power distribution of P is known, but not L itself. Since 
L is unknown, in the simulations we enter different candidate 
values of it into the following procedure: 

1) Computing the moments Pi = iri(P*) of P. 

2) The moments Itl^P^) are fed into the formulas of 
Proposition |7] and we thus obtain candidate moments 
Wi of the sample covariance matrix W. 

3) Compute the sum of the square errors between these can- 
didate moments, and the moments Wi of the observed 
sample covariance matrix W, i.e. compute Y^i=i \ ~ 

The estimate L for the number of users is chosen as the one 
which gives the minimum value for the sum of square errors 
after these steps. 

In Figure ID we have set a = \/(U, N = 100, and L = 36. 
We tried the procedure described above for 1 all the way up 
to 100 observations. It is seen that only a small number of 
observations are needed in order to get an accurate estimate 
of L. When K — 1, \t is seen that more observations are 
needed to get an accurate estimate of L, when compared to 
K = 10. 

B. Estimation of the number of paths 

In many channel modeling applications, one needs to deter- 
mine the number of paths in the channel ll38l . For this purpose, 
consider a multi-path channel of the form: 



hij) = ^ Si5(T - Ti 



Here, Si are i.d. Gaussian random variables with power Pi and 
Ti are uniformly distributed delays over [0, T]. The Si represent 
the attenuation factors due to the different reflections. L is the 
total number of paths. In the frequency domain, the channel 
is given by 



L 

H{f) = V s,G(/)e-J2-^-' 



Sampling the continuous frequency signal at fi — where 
W is the bandwidth, the model becomes (for a given channel 
realization) 

H = VP^s 



where 



V 



/ 1 



V 



= -i27r-: 



-j2Tv{N-l)^ 



-i2.(7V-l)^ 



(43) 

We will here set = T = 1, which means that the Wj of 
([T]i are uniformly distributed over [0, 27r). The corresponding 
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(R) K = l 
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Actual value of L 



Number of observations 



(b) K = 10 



Fig. 8, 



Estimate for tlie number of usens. Actual value of L is 36. Also, cr = 
N = 100. The powers were 0.5, 1, and 1.5, with equal probability. 



ci = f , C2 = and C3 = ^. Tlien 



N 



K- 



E[Wi] 
E[W2\ 



- C2P1 



N 



P2+C2{c2+Cz){Pif 



-2(72(c2 + C3)Pi+a4(l + Ci) 



E[W3] = 



C2 ( 1 + ;^ 



1 



1 

X2 



1 - ^ ) ( 3C^ ( 1 + ) + 3C2C3 ) P1P2 



C2(l + ;^j+3c^C3+C2C^j 
-3.^((1+C,)C2 + ||) (l-l)P2 



-3ct 



2 / Cicf 



KL 



3C2C3 (Pi) 



-3^4 c? 



3ci + 1 + ) C2P1 



3ci 



1 



1 



Just as Proposition |7] ttiis is proved in Appendix |M] In ttie 
following, this result is used in order to determine the number 
of paths as well as the power of each path. The different 
convergence rates of the approximations are clearly seen in 
the plots. 

In Figure |9] the number of paths is estimated based on the 
procedure sketched above. We have set a — VO.l, N = 100, 
and L — 36. The procedure is tried for 1 all the way up to 
100 observations. The plot is very similar to Figure [8] in that 
only a small number of observations are needed in order to 
get an accurate estimate of L. When = 1, it is seen that 
more observations are needed to get an accurate estimate of 
L, when compared to K = 10. 

For the estimation of powers simulation, we have set K = 
N = L = 144, and a = \/0.1, following the procedure also 
described above, up to 1000 observations. The second plot in 
Figure [TO] shows the results which confirms the usefulness of 
the approach. 



eigenvalue histogram was shown in Figure [3] When additive 
noise (n) again is taken into consideration, our model again 
becomes that of ( [38] l, the only difference being that the phase 
distribution of the Vandermonde matrix now is uniform. L now 
is the number of paths, N the number of frequency samples, 
and P is the unknown LxL diagonal power matrix. Taking K 
observations we arrive at the same form as in dTO . In this case 
with uniform phase distribution, we can do even better than 
Proposition |7] in that one can write down estimators for the 
moments which are unbiased for any number of observations 
and frequency samples: 

Proposition 8: Assume that V has uniform phase distribu- 
tion, and let Pi be the moments of P, and Wi = trjv(W*) 
the moments of the sample covariance matrix. Define also 



C. Estimation of wavelength 

In the field of MIMO cognitive sensing 1391, 1401, terminals 
must decide on the band on which to transmit and in particular 
sense which band is occupied. One way of doing so is to 
find the wavelength A in ( [39] l, based on some realizations of 
the sample covariance matrix. In our simulation, we have set 
K = 10, L = 36, N = 100, and a = ^/0J, in addition to 
A = 2, d = l,a — J. We have tried values between and 
5 as candidate wavelengths (to be more precise, the values 
0.05, 0.1, 0.15, 5 are tried), and chosen the one which gives 
the smallest deviation (in the same sense as above, i.e. the sum 
of the squared errors of the first three moments are taken) 
from a different number of realizations of sample covariance 
matrices. The resulting plot is shown in Figure [TTl and shows 
that the Vandermonde deconvolution method can also be used 
for wavelength estimation. 
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Fig. 9. 



Estimate for 
X = 100. 



(b) X = 10 

the number of paths. Actual value of L is 36. Also, 



+ First eigenvalue 
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^+ , 1 — , , , , 
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(b) The model g3) of Section IV-B] 

Fig. 10. Estimation of powers for the two models (39) and (43), for various 
number of observations. K = N = L = 144, and a = VO.l. The actual 
powers were 0.5, 1, and 1.5, with equal probability. 



D. Signal reconstruction and estimation of the sampling dis- 
tribution 

For signal reconstruction, one can provide a general frame- 
work where only the sampling distribution matters asymptot- 
ically. The sampling distribution can be estimated with the 
help of the presented results. Several works have investigated 
how irregular sampling affects the performance of signal 
reconstruction in the presence of noise in different fields, 
namely sensor networks BTl . Il42l . image processing ||431 . 
||441. geophysics [451, and compressive sampling p^. The 
usual Nyquist Theorem states that for a signal with maximum 
frequency /max, one needs to sample the signal at a rate 
which is at least twice this number. However, in many cases, 
this can not be performed, or one has an observation of a 
signal at only a subset of the frequencies. Moreover, one feels 
that if the signal has a sparse spectrum, one can take fewer 
samples and still have the same information on the original 
signal. One of the central motivations of sparse sampling is 



exactly to understand under which condition one can still 
have less samples and recover the original signal up to an 
error of e |47|. Let us consider the signal of interest as a 
superposition of its frequency components (this is also the 
case for a unidimensional bandlimited physical signal), i.e. 

and suppose that the signal is sampled at various instants 
[ii, ii] with ti E [0, 1]. This can be identically written as 

N-l 

= 7^ E ^^^"^'^ 

^ k=0 

or r = V^s. In the presence of noise, one can write 

r = V^s + n, (44) 
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Fig. 1 1 . Estimation of wavelength. Deconvolution was performed for varying 
number of observations, assuming different wavelengtlis. In tlie true model 
{39j, \ = 2,d = 1, a = K = 10, L = 36, N = 100, and a = Vol. 



Fig. 12. Estimated values of a using j46j-(48), for various number of 
observations, and for K = 10, L = 36, N = 100, a = VoT- The actual 
value of a was ^. 



where r = [r{uji), ...r{uji^)]'^ , s and n are as in ( l38T l, and with 
V on the form ([T]i. A similar analysis for such cases can be 
found in lfT6l . 

In the following, we suppose that one has K observations 
of the received sampled vector r: 



[ni, ...,nK\ 



(45) 



The vector r is the discrete output of the sampled continuous 
signal r{w) for which the distribution is unknown (however, 
c is known). This case happens when one has an observation 
without the knowledge of the sampling rate for example. The 
difference in ( l45l l from the model (l4Tl i lies in that the adjoint 
of a Vandermonde matrix is used, and in that there is no 
additional diagonal matrix P included. The following result 
can now be stated and proved similarly to Proposition|2]and|8] 

Proposition 9: 



' O o qCO 
On 

(Too qO 



Estimated 1^ 

- Actuai ig 
Estimated 1^ 

- Actual I, 



OOD c 
<>o o < 



l + l + I ^K+-H-++^ + +++ l + l ++ I ^ I ^H^-^ + 1 l + l f l + l ^^+ l l -^ l I I+I 4f+ I + 1 1 1 f+-^ l -H-+f-ffl ^- 



300 

Number of observations 



Fig. 13. Estimated values of I2 and ^3 using )46t - j48t . 
of observations, and for K = 10, L = 36, N = 100, cr 
value of a was ^. 



for various number 
= VO.l. The actual 



E [ir„ (W)] 
E [tr,, (W^)] 
E [trn (W3)] 



1 + (46) 

C2/2 + (l+C3)(l+a2)2 (47) 
l+3c2(l+C3)/2 

3c3 + 4 + clh 

3a^{l + 3C3 +4+ C2(l + C3)/2) 



3ct'^C2(c^ + 3c3 + 1) 
a6(c2+3c3 + l), 



where lim 



JV- 



N_ 
K 



L_ 

+00 ^ 



(48) 



C3, 



ci, limAT^oc- 7v = C2, limAT 
/„ is defined as in Proposition |7] and W = 

The proof of Proposition |9] is commented in Appendix |M] 
We have tested (|46]l-(|48]l by taking a phase distribution ui 
which is uniform on [0, a], and elsewhere. The density is 
thus ^ on [0, a], and elsewhere. In this case we can compute 



that 



h 



2tt 



The first of these equations, combined with (l46]l-(l48Tl. enables 
us to estimate a. This is tested in Figure [12] for various 
number of observations. In Figure [13] we have also tested 
estimation of I2 , /3 from the observations using the same 
equations. When one has a distribution which is not uniform, 
the integrals 13,14,... would also be needed in finding the 
characteristics of the underlying phase distribution. Figure [13] 
shows that the estimation of I2 requires far fewer observa- 
tions than the estimation of I3. In both figures, the values 
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K = 10,L = 36,N = 100, and a = VOJ were used and a 
was J. It is seen that the estimation of is a bit off even for 
higher number of observations. This is to be expected, since 
an asymptotic resuh is applied. 



VI. Conclusion and further directions 

We have shown how asymptotic moments of random Van- 
dermonde matrices with entries on the unit circle can be 
computed analytically, and treated many different cases. Van- 
dermonde matrices with uniform phase distribution proved to 
be the easiest case, and it was shown how the case with 
more general phases could be expressed in terms of this. The 
case where the phase distribution has singularities was also 
handled, as this case displayed different asymptotic behaviour 
Also, mixed moments of independent Vandermonde matrices 
were investigated, as well as the moments of generalized 
Vandermonde matrices. In addition to the general asymptotic 
expressions stated, exact expressions for the first moments of 
Vandermonde matrices with uniform phase distribution were 
also stated. We have also provided some useful applications of 
random Vandermonde matrices. The applications concentrated 
on deconvolution and signal sampling analysis. As shown, 
many useful system models use independent Vandermonde 
matrices and Gaussian matrices combined in some way. The 
presented examples show how random Vandermonde matrices 
in such systems can be handled in practice to obtain estimates 
on quantities such as the number of paths in channel modeling, 
the transmission powers of the users in wireless transmission, 
or the sampling distribution for signal recovery. The paper 
has only touched upon a limited number of applications, but 
the results already provide benchmark figures in the non- 
asymptotic regime. 

From a theoretical perspective, it would also be interesting 
to find methods for obtaining the generalized expansion co- 
efficients Kp^uj,\ from Kp^u.u, similar to how we found the 
expansion coefficients Kp ^^ from Kp u- This could also shed 
some light on whether uniform phase- and power distribution 
also minimizes moments of generalized Vandermonde matri- 
ces, similarly to how we showed that it minimizes moments 
in the non-generalized case. 

Throughout the paper, we assumed that only diagonal 
matrices were involved in mixed moments of Vandermonde 
matrices. The case of non-diagonal matrices is harder, and 
should be addressed in future research. The analysis of the 
maximum and minimum eigenvalue is also of importance. 
The methods presented in this paper can not be used directly 
to obtain explicit expressions for the p.d.f. of the asymptotic 
mean eigenvalue distribution, so this is also a case for fu- 
ture research. A way of attacking this problem could be to 
develop for Vandermonde matrices analytic counterparts to 
what one has in free probability, such as the R-, S-, and the 
Stieltjes transform |20|. Interestingly, certain matrices similar 
to Vandermonde matrices, have analytical expressions for the 
moments: in flT], analytical expressions for the moments of 
matrices with entries of the form Aij — F{uJi — ujj) are found. 
This is interesting for the Vandermonde matrices we consider. 



since 



^^^^ 



sin(f(, 



.)) 



iVsin (^(t^j - L^j)) 



Unfortunately, the function F]y{x) — jy^^ i depends on 
the matrix dimension N, so that we can not find a function F 
which fits the result from |17|. 

Finally, another case for future research is the asymptotic 
behaviour of Vandermonde matrices when the matrix entries 
lie outside the unit circle. 



Appendix A 
The proof of Theorem[I] 

We can write 

E [tTL (Di(7V)V^VD2(iV)V^V • • • D„(iV)V^V)] (49) 



D2(iV)(j2,j2)V^02,i3)Vfe,j3) 



D„(A^)0„,j„)V^(j„,zi)V(*i,ji)) 

(50) 

The (ji, ...,j„) uniquely identifies a partition p of {1, 
where each block Wj of p consists of the positions of the 
indices which equal j, i.e. 

- {k\jk = j}. 
We will also say that (ji, j„) give rise to p. Write 

When (ji, ...jjn) give rise to p, we see that since 

Jwji = Jwj2 — ■ • • — Jwjin'jl ' 

we also have that 



UJi = LUi = ■ ■ ■ = UJ 



^^i\W. I ' 



and we will denote their common value by low- as in Defini- 
tion |8] With this in mind, it is straightforward to verify that 
can be written as 

E E E 

giving rise to p 

xDi(iV)(ji,ji) X ... X D„(iV)(j„,j„), (51) 
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where zi, i„ takes values between and iV — 1. We will in 
the following switch between the form ( ISTI i and the form 

E E E 

giving rise to p 

_/yIp|-"-i^Ip|-i 2^-IpI 




\k=l 

xDi(7V)(ji,ji) X ••• X D„(iV)(j„,j„),(52) 

where we also have reorganized the powers of N and L in 
( fSTT l. and changed the order of summation (i.e. summed over 
the different ii, first). Noting that 

/ „ \ 

= N\p\-'^-^e\ ^ JJ^ e-''*^"''(''-i'"'^''<'''^*'= (54) 
\(M,...,»„)fc=i / 

= ^|phn-l£; j -Q j ^ gi(^6(;c-l)-a>Kfc))ife j j (55) 



\k=l \ik=0 



= N 



\p\-n- 



(56) 



fe=i 



r " 1 _ gj^'('^b(fc-i)-'^b(fc)) 

7(0,27r)IPl 1 - e-''('^''('=-i)-'^H'=)) 

dwi • • • dt^ipi (57) 
= Kp,^,N, (58) 

Definition [8] of the Vandermonde mixed moment expansion 
coefficients comes into play, so that ( |52] i can also be written 

E E 

pe^(") (ji,...,j„) 

giving rise to p 

xDi(7V)(ji,ji)... X xD„(7V)0„,j„).(59) 

The notation for a joint limit distribution simplifies ( |52] i. 
Indeed, add to (|52| i for each p the terms 

E E 

giving rise to p 

xT>i{N){ji,n) • • • X D„(A^)(j„, j„). (60) 
These go to as ^ oo, since they are bounded by 



After this addition, the limit of (|59l l can be written 

J2 c\p\'^Kp^^Dp, (61) 



which is what we had to show. g 
We also need to comment on the statement of Theo- 
rem |6] where generalized Vandermonde matrices are con- 
sidered. In this case, the derivations after ( l52b are different 
since the power distribution is not uniform. For the case 
of (|22l), we can in ^ replace YJ'^^^^e^i^bik-i)-'^Hk))ik 

with ^P/ivC'')^-'''^"''"'""""''"'''', since the number of 

occurrences of the power e-'''('^''(''-i)~'^''<'=)^ is Npf^(r). The 
rest of the proof of Theorem |6] follows by canceling n powers 
of N after this replacement. The details are similar for the 
case ( l23T l. where the law of large numbers is applied to arrive 
at the second formula in 



Appendix B 
The proof of Proposition [U 



Note that for each block W^. 



E 



j{T.ke 



ik-l-J2k 



when 



and 1 if 



keWi 



E = E 

keWj keWj 
If we denote by S'pjv the set of all n-tuples (ii, , 



(62) 



(0< 



ik < N — \, \ < k < n) which solve ( |62] |. and define |5'p,Ar| 
to be the cardinality of 5pjv, it is clear that 



N- 



N- 



It is straightforward to show that the solution set of ( |62] i has 
n + 1 — IpI free variables. After dividing the equations ( l62b by 
N and letting N go to infinity, ifp.u can thus alternatively be 
expressed as the volume in M"+i^IpI of the solution set of 



E 



Xk-l 



keWj 



Xk, 



(63) 



with < Xfe < 1. It is clear that the volume of this solution set 
computes to a rational number It is the form ( |63] | which will be 
used in the other appendices to compute Kp u for certain lower 
order p. Appendix D of lfT6l states the same equations for 
finding quantities equivalent to Vandermonde mixed moment 
expansion coefficients for the uniform phase distribution. The 
fact that Kp^u < 1 follows directly from Appendix D of lfl6l . 
The same applies for the fact that Kp,u = 1 if and only if p 
is noncrossing. 

For any p, we can define a partition of {1, n} into — 
IpI blocks, where two elements are defined to be in the same 
block if and only if the corresponding variables in solutions 
to (l63b are linearly dependent. When p is noncrossing, it is 
straightforward to show that two such variables are dependent 
if and only if they are equal, and also that this partition is the 
Kreweras complement K{p) of p. This fact is used elsewhere 
in this paper 

We will also briefly explain why the computations in this 
appendix are useful for generalized Vandermonde matrices 
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with uniform phase distribution. For ( |22] ). the number of 
solutions ii,...,ik to ( |62] i needs to be muhiplied by 

since each ij now may occur Npf,^ (ij) times. This means that 
Kp.uj.f can be computed as the integrals in this appendix, but 
that we also need to multiply with the density pf for each 
variable. The computations of these new integrals become 
rather involved when / is not uniform, and are therefore 
dropped. 

Appendix C 
The proof for PropositionE] 

We will in the following compute the volume of the solution 
set of (|63]l, as a volume in [0, C R'^+^-IpI, as 

explained in the proof of Proposition [T] These integrals are 
very tedious to compute, and many of the details are skipped. 
The formula 

(r + s+1)! io 

can be used to simplify some of the calculations for higher 
values of n. 



A. Computation of K^^i^^y ^2,i}}.u 

This is equivalent to finding the volume of the solution set 

of 

Xi + X3 — X2 + X4 

in M^. Since this means that 

X4 = xi + X3 ~ X2 lies between and 1, 

we can set up the following integral bounds: When X1+X3 < 
1, we must have that < X2 < xi + X3, so that we get the 
contribution 

/ / dx2dx3dxi, 

IQ Jo Jo 

which computes to i. When 1 < xi + X3, we must have that 
xi + X3 — 1 < 2:2 < 1, so that we get the contribution 

»i .1 .1 

/ / dx2dxzdxi, 

fO J 1 — Xl J Xl-^X3 — 1 

which also computes to ^. Adding the contributions together 
we get |, which is the stated value for -^'{{i, 3}. {2, 4}}. ti- 
lt turns out that when the blocks of p are cyclic shifts of each 
other, the computation of Kp^u can be simplified. Examples 
of such p are {{1, 3}, {2, 4}} (for which we just computed 
ifp,J, {{1,3, 5}, {2, 4, 6}}, and {{1, 4}, {2, 5}, {3, 6}}. We 
will in the following describe this simplified computation. Let 
af"'\x) be the polynomial which gives the volume in R™^^ 
of the solutions set to xi + • • • + Xm = x (constrained to 
< Xi < 1) for Z < a; < ^ + 1. It is clear that these satisfy the 
integral equations 



(x) 



a[":l{t)dt+ / a["'\t)dt, 



which can be used to compute the a™(a;) recursively. Note 
first that a'i^\x) — 1. For to = 2 we have 



a^^\x) = I al^''{t)dt^ 



a?\x) 
For m = 3 we have 

a^o\^) = 



a^^\t)dt — -X 



1 



,(2), 



af\x) = / a^i^\t)dt+ I a\''{t)dt 

Jx-l Jl 

- l-i(x-lf-i(2-xf 



(x) 



a[^\t)dt ^ ^(3-x)2. 



1 



x-l 



(64) 



B. Computation 0/ i^{{i,3,5},{24,6}},« 
For TO = 3, integration gives 

\aff{t)dt+ (\af^f{t)dt+ l\aff{t)dt, 
Jl J2 

which computes to This is the stated expression for 

'^{{l,3,5},{2,4,6}},ti- 

C. Computation 0/ ii:{{i_4} {2,5},{3,6}},u 

This is equivalent to finding the volume of the solution set 

of 

Xi + X4 = X2 + Xs = X3 + X6 

in R^, which is computed as 

\a^^^f{t)dt+ f{a^^^f{t)dt, 
lo Jl 

which computes to i. This is the stated expression for 

^{{1,4},{2,5},{3,6}},U- 

D. Computation of K{{i^4} ^{23}, {3,5}}, u 

This is equivalent to finding the volume of the solution set 

of 

Xi + X4 = X2 + X5 
X2 + Xg = X3 + Xi 

in R*. Since this means that 

X5 = xi — X2 + X4 lies between and 1, 
xe — xi — X2 + X3 lies between and 1, 

we can set up the following integral bounds: 

For X2 > xi we must have X2 — xi < X3,X4 < 1, so that 
we get the contribution 

r-l 1-1 i-l rl 

/ / C?X4C?X3C?X2(iXl, 

/O J Xi J X2 — X1 J X2^Xi 

which computes to j. It is clear that for xi > X2 we get 
the same result by symmetry, so that the total contribution is 
i + i = i, which proves the claim. 
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E. Computation o/Xni gj^ig 7}_{2,4,6}},« 

This is equivalent to finding the volume of the solution set 



of 



in 



\ or 

X7 = 



Xi+ = X2+ Xq 

xz-\- xj = Xi+ xi 



xc, + xi ~ X2 lies between and 1, 
X4 + .Ti — X3 lies between and 1 . 



(65) 



This can be split into the following volumes: 

1) Xi <X2 < X^, 

2) CCi < 0:3 < X2, 

3) X^ <X2 < Xi, 

4) X2 < X'i < Xi, 

5) X2 < Xl < X-i, 

6) X^ < Xl < X2- 

Each of these volumes can be computed by setting up an in- 
tegral with corresponding bounds. Computing these integrals, 
we get the values respectively. Adding 

these contributions together, we get 

4 11 _ 27 _ 9 
15 ^ 60 ^ 60 ~ 20' 
which proves the claim. 

F. The computation o/i4:{{i_6},{2,4},{3,5,7}},u 

This is equivalent to finding the volume of the solution set 

of 

Xi+ Xq = X2+ X-j 
X2 + X4 — X3 + X5 

in R^, or 

xq — XT + X2 — Xl lies between and 1, 
X5 — X4 + X2 — X3 lies between and 1, . 



This can be obtained from ( 165b by a permutation of the 
variables, so the contribution from -ftr{{i.6},{2,4}.{3,5.7}},« must 
also be J^, which proves the claim. 

Appendix D 
The proof for Proposition!!] 

Note first that multiplying both of sides of ( fT3] ) with c gives 



cMn = Kp,^{cD)p, 



(66) 



where we now can substitute the scaled moments (fT4l i-(fT5]l. 
With Di(iV) = D2(iV) = •■• = D„(iV) = Ti[N), Dp 
as defined in Definition |2] does only depend on the block 
cardinalities \Wj\, so that we can group together the Kp_^ 
for p with equal block cardinalities. If we group the blocks of 
p so that their cardinalities are in descending order, and set 



where ri> r2> ■ ■ ■ > r^, and also write 



(67) 



can be written 



(68) 



riH \-rk=n 



For the first 5 moments this becomes 



mi = Kidi 

1712 = K2d2 + Kisd\ 

1713 = K^ds + K2,id2d\ 
m4 = K/idi + K^id'idi 

Ki,is,i4 
1715 = K^d^ + K4^id4di 



Kixidl 



(69) 
(70) 
(71) 



K2,2dl + K2A,ld2dl + 



(72) 



+K3^2d3d2 + 

K3^i,id3dl + K2,2,idldi + K2s,i,id2dl + 
Ki^i,i,i,idl. (73) 

Thus, to prove Proposition |3] we have to compute the 
^ri,r2,...,rfc by goiug through all partitions. We will have use 
for the following result, taken from |30|: 

Lemma 1: The number of noncrossing partitions in NC{n) 
with ri blocks of length 1, r2 blocks of length 2 and so on 
(so that ri + 2r2 + 3r3 + • • • rir„ = n) is 



n! 



ri!r2! • • • r„!(n + 1 — ri 



r2 



■rn)l' 



■J>{n) 



{p = {Wi, Wk} e y{n)\\w,\ = r,yi}, 



Using this and a similar formula for the number of par- 
titions with prescribed block sizes, we obtain cardinalities 
for noncrossing partitions and the set of all partitions with 
a given block structure. These numbers are the used in the 
following calculations. For the proof of Proposition[3] we need 
to compute ( |67] | for all possible block cardinalities (ri, r/c), 
and insert these in (|69]l-(l73]l. The formulas for the three first 
moments are obvious, since all partitions of length < 3 are 
noncrossing. For the remaining computations, the following 
two observations save a lot of work: 

• If pi e y{ni), p2 G ^{712) with rii < 712, and pi can be 
obtained from p2 by omitting elements k in {l,...,n2} 
such that k and k + 1 are in the same block, then we 
must have that Kp^^u = Kp^^u- This is straightforward 
to prove since it follows from the proof of Proposition [T] 
that ik+i can be chosen arbitrarily between and N — 1 
in such a case. 

• Kp^_u = Kp.^ u if the set of equations ( |63]) for pi can 
be obtained by a permutation of the variables in the set 
of equations for p2- Since the rank of the matrix for 
( |63l l equals the number of equations —1, we actually 
need only have that \pi\ — 1 of the \pi\ equations can 
be obtained from permutation of \p2\ — 1 equations of 
the IP2I equations in the equation system for p2- 
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A. The moment of fourth order 

The result is here obvious except for the case for the three 
partitions with block cardinalities (2, 2) (for all other block 
cardinalities, all partitions are noncrossing, so that A'ri,r2....,rfc 
is simply the number of noncrossing partitions with block 
cardinalities (ri,...,rfc). this number can be computed from 
Lemma [T]i. Two of the partitions with blocks of cardinality 
(2, 2) are noncrossing, the third one is not. We see from 
Proposition |2] that the total contribution is 

K2,2 = 2 + Ki 
= 2 



4{1 3},{2,4}},« 

^ 8 

3^3' 



The formula for the fourth moment follows. 

B. The moment of fifth order 

Here two cases require extra attention: 

1) p = {Wi,W2} with \Wi\ = 3, 11^21 = 2.- There are 10 
such partitions, and 5 of them have crossings and contribute 
with A^{{i,3},{2,4}},M- The total contribution is therefore 



5 

5 + 5 X 



5 X -f*-'{{1.3},{2,4}},u 
2 _ 25 



3 3 ■ 

2) p = {Wi,W2,W3} with \Wi\ = IW2I = 2, IW3I = 1.- 

There are 15 such partitions, of which 5 have crossings. The 
total contribution is therefore 

10 + 5 X iir{{i.3}_{2.4}},« 

^ 10 + 5x| = f. 

The computations for the sixth and seventh order moments 
are similar, but the details are skipped. These are more tedious 
in the sense that one has to count the number of partitions with 
a given block structure, and identify each partition with one 
of the coefficients listed in Proposition |2] 

Appendix E 
The proof of Proposition]!] 

Cij (D(Af)V^V) is computed as in Appendix lAl Since 
some terms in E [tr^ (A*) tr^ (A^)] cancel those in 
E [tri^ (a*)] E [ir^ (A^)], we can restrict to summing over 
partitions of 1, 2, i + j where at least one block contains 
elements from both [1, i] and [i + 1, i+j] - We denote this 
set by j), and set n = i + j. In our new calculations. (l52l i 
now instead takes the form 

^ E E E 

giving rise to p 



n 



xDi{N){ji,,n) X ••• X D„(iV)(j„,j„),(74) 

where the normalizing factor L from Definition |4] has been 
included. Simplifying this as in Appendix |A] and restricting 
to uniform phase distribution, we obtain 

lim LC,,j(D(A^)V^V) = V c\P\-^K2,p,uDp, 

P6 3'(i,i) 



where K2 « u is the volume of the solution set of 



(75) 



where a is the permutation which shifts [1, i] and + + 
j] to the right cyclically so that the result is contained within 
the same interval. Thus, when the normalizing factor L is 
included, we see that the second order moments exist. 

6*2,2 (D(^)V'^V) in dnil is computed by noting that 
^2,{{i,3},{2,4}},« and if2,{{i,4},{2,3}},« both equal |, and that 
there are 9 other partitions in 7{2, 2), and K2.tt.u = 1 for all 
these TT (all these values are computed as in Appendix ICb. By 
adding up for the different block cardinalities we get that 



c lim LC2,2(D(iV)V^V) = ^4 + 'idadi-dj + 4^2^?, 

L — ^CJO o 



and using the substitution ( fT6b we arrive at the desired result. 



Appendix F 
The proof of Theorem[2] 



In order to get the exact expressions in Theorem |2l we now 
need to keep track of the Kp_u,N defined by ( fTOb . not only the 
limits Kp u (if we had not assumed u) = u, the calculations 
for Kp^^^M would be much more cumbersome). When p is 
a partition of {1, ...,??} and n < 4, we have that Kp^u,N = 
Kp,u = 1 when p ^ {{1, 3}, {2, 4}}. We also have that 



K 



{{1,3},{2,4}},U,W 



(76) 



where we have used that ^ 



N 



{N+i){N+^) m. 



We also need the exact expression for the quantity 



J2 1"! Di (iV) (ji , ji ) X • • • X D„ {N) (j„ , j„) 



giving rise to p 



from ( |59] l (i.e. we can not add ( |60] | to obtain the approximation 



(|6T] | here). Setting £)„ 



(JV,L) 



Ili=i ^w^'^K we see that 



n(JV,i) 
p 



trL (D"(7V)), and 



J2 L'^'i-i^iTp,, 

p'>p 



(77) 
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which can be used recursively to express the Tp in terms of 
the Dp^'^\ We obtain the following formulas for n = 4: 



-t{{l,2,3,4}} - ^4 



(78) 



^{{1,2,3}, {4}} = £'3 ^1 ^4 



r{{l,2},{3,4}} - (£'2 
J{{1,2},{3},{4}} ^ {U^ ) 



(80) 



-l-r^{NX)j^{N,L) 



3 



J{{1},{2},{3},{4}} = (i'l ) 



(81) 



3L-'D 



(i?^^'^')^ (82) 



For n — 3 and n = 2 the formulas are 

J{{1.2,3}} = ^3 
J{{1,2},{3}} - ^1 ^2 ^ ^ ^3 
J{{1},{2},{3}} = {iJl ) -SL 



-2L-^D 

(NX) 



(83) 
(84) 

(85) 
(86) 



T{{i,2}} - D. 
^{{i},{2}} = (^1 ) - ^ ^2 ■ (87) 

It is clear that (|28]l-(l82]i and (l83]l-(l82]l cover all possibiUties 
when it comes to partition block sizes. Using (fl4]i-(fT5Tl, and 
putting dZDl, (|78ll-(l82li, and (I83]l-(l87li into ^ we get the 
expressions in Theorem |2] after some calculations. 

If we are only interested in first order approximations rather 
than exact expressions, dTTj i gives us 



p' >P 

|p|-|p'l=i 



which is easier to compute. Also, we need only first order 
approximations to Kp u,n, which is much easier to compute 
than the exact expression. For (|76] l, 

2 

K{{U3}.{2A}}.u^N ~ 3 

is already a first order approximation. Inserting the approxima- 
tions in ( |59] l gives a first order approximation of the moments. 

Appendix G 
The proof of Proposition [5] 
We only state the proof for the case c = 1. In ll32l it 
is stated that the asymptotic 2n-moment (1712,1) of certain 
Hankel and Toeplitz matrices can be expressed in terms of 
volumes of solution sets of equations on the form ( |63l ). with p 
restricted to partitions with all blocks of length 2. Rephrased 
in our language of Vandermonde mixed moment expansion 
coefficients, this means that 



TO2J1 



E 



(88) 



p G T(2n) 

p has two elements in each block 



In the language of ||32|| , the formula is not stated exactly 
like this, but rather in terms of volumes of solution sets of 
equations of the form ( |63] |. This translates to (ISST i, since we in 
Appendix iBl interpreted Kp ^ as such volumes. In Proposition 
A.l in lf32l . unbounded support was proved by showing that 
(™2n)^^"' 00. Again denoting the asymptotic moments of 
Vandermonde matrices with uniform phase distribution by Vn, 
we have that m2n < Vzn, since we sum over a greater class 
of partitions than in dSSl ) when computing the Vandermonde 
moments. This means that (V2n)^^" 00 also, so that the 
asymptotic mean eigenvalue distribution of the Vandermonde 
matrices have unbounded support also. 

Appendix H 
The proof of Theorem[3] 
We will use the fact that 



(27r)IPlJV''+i-IPl ^ 

•l-lfe=l i_e^"(="i)(fc-i) -^6(fc)) 



/(0,27r)l 

dxi ■ ■ • dx\p\ , 



(89) 



where integration is w.rt. Lebesgue measure. 

For p = In Theorem [3] is trivial. We will thus assume that 
p ^ In in the following. We first prove that limjv^oo Kp.tu.N 
exists whenever is continuous. To simplify notation, define 

M T 



k=l 



_ pji'^b(k-l)-'^b(k)) 



n 

k=l 



sm 



sm 



((^6(fe-l) - t^fc(i;))/2) 



and set uj = {uji, ...,uj^p^) and du = duji ■ ■ ■ dcu^p^. Since uj is 
continuous, there exists a Pmax such that Puj{oJi) < Pmax for 
all LJi. Then we have that 



\K. 



p,CiJ,N\ 



< 



pi rifc^i 



^ /[0,2 



sin(Af(x(,(fc_i)-a;b(fc))/2) 
sin((i:b(fc)-2;m. + i))/2) 



dx, 
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where we have converted to Lebesgue measure, and where we on the form r 



have also written dx = dxi ■ ■ ■ dx^p^. Consider first the set 



|a;|i-("-2). 

( |93] ) is bounded by 



for some constant r f36l. Therefore, 



U = {uj\\xb{k-i) - Xb(k) I < TrVfc}. 



When 22: < 



TV - Pfc(fc-l) ^ ^b{k) 

sin [N{xb(k-i) - a;b(fc))/2 



< TT, it is clear that 

4 



|.|>1 



dx 



1 



, , |^|2-(n-2). 
\x\>l Fl ^ ' 

2r 



■dx 



sm 



< 



Xb(k-l) — Xb{k) 



(90) 



since |sin (A^(a;b(fe-i) — a;b(fe))/2)| < 1, and since |sin(a;)| > 
If I when < f . When |a;fc(fc_i 



Xb{k) I < 1^ we have that 



{N{xbi 



k-l) 



Xb(k))/'2) 



sm 



{{xb(k-i) - Xb(k))/2) 



< N. 



(91) 



Let ki, fc|p| G Z, and assume that A;|p| = 0. By using the 
triangle inequality, it is clear that on the set 



(n-2)e- 1 

This proves that the entire sum (|93] ) is bounded, and thus 
also the statement on the existence of the limit K{p,uj) in 
Theorem [3] when the density is continuous. 

For the rest of the proof of Theorem [3], we first record the 
following result: 

Lemma 2: For any e > 0, 

F{uj)duj^O, (95) 



lim , , , 



Dk,, 



N 



where 



< ^Vl < z < |p|}, 



when \kr — ks\ > 2 for all r,s, the i'th factor in F{x) is 



bounded by 



4JV 



due to ( [90b . Also, when \kr- 



kg\ < 2 for some r, s, the corresponding factors in F{x) are 
bounded by N on Dki^...^k\p\ due to ( |9T| i. Note also that the 
volume of Dki....,k^^^_^ is {2-k)\p\-^ N^-\p\ . By adding some 
more terms (to compensate for the different behaviour for \kr — 
A:^ I > 2 and \ kr — ks\ < 2), we have that we can find a constant 
D that 



J^J^\F{x)\dx 

1 j^n 



^ y~]n</gi ;^|p|-i<w 

all ki different 

= (27r)l''li?"Eo<.i 



(n"=i 



,fc|p|-l<N 

all ki different 



) 2^(2^)l''l-i7Vi-l''l 



lr=l \k. 



(92) 

where we have integrated w.rt. x^p^ also (i.e. /c|p| is kept 
constant in (|92]i). A similar analysis as for J7 applies for the 
complement set 

V = {uj\n < \xb(k-i) - Xb(k} \ < 27r for some k}, 
so that we can find a constant C such that 



Proof: The set B^ r corresponds to those ki, fc|p| in (|93] l 
for which |fcb(r_i) — fcf,(r)| > Thus, for large N, we sum 
over fci, fc|p| in ( |93] l for which |fcf,(r-i) — ^fc(r)| is arbitrarily 
large. By the convergence of the Fourier integral of j^, it is 
clear that this converges to zero. g 

Define 

B, = {(wi, ...,uj\p\)\\uJi - Ljj\ > e for some i,j}. 

If w S B^, there must exist an r so that \u!b^r-i)^'^b{r) I > 7^ > 
so that (jj G B^ 2€/n- This means that 

B^ C Uri?r,2e/n7 

SO that by Lemma |2] also 
1 



lim , , II 



F{uj)duj = 0. 



N^+^-\p\ /[0,27r)lpl \F{x)\dx 

— C' X]o<fci,.^.,fc|p|-l<iV nr=l Jk^ 

all ki different 



1— 



(93) 



It is clear this sum converges: First of all, this is only needed 
to prove for p — On, since the summands for p ^ 0„ is only 
a subset of the summands for p = 0„. 

Secondly, for p = 0„, (|93T l can be bounded by considering 
convolutions of the following function with itself: 



(/[c 
dx\. 



fix 







for \x\ > 1 
for \x\ < 1 



(94) 



The assumption that f{x) = in a neighbourhood of zero is 
due to the fact that the ki are all different. Note that |/(x)| < 
jjp-TT for any < e < L Also, the n — 2-fold convolution 
(we wait with the rt — I'th convolution till the end) of , 
with itself exist outside whenever < (n — 2)e < 1, and is which is what we had to show. 



This means that in the integral for Kp^^^N, we need only 
integrate over the uj which are arbitrarily close to the diagonal, 
(where uji — ■ ■ ■ — u!\p\)- We thus have 

Kp^^ = limjv^oo jv+^i-ipi /[o,27r)ipi ^(^) UltiPUxr)dx 
= liniAT^oo jvn+\-ip| j\o,2^)M F{x)p^{xipiyp\dx 

= limAr_^oo jvn+\- | pi lo'' P'^i^lpiy''^ 

f[o,27r)ipi-i-^(^)'^2;i---da;|p|_i) 

IpI- 

We used here that the density is continuous. Using that 

limAT^oo jV'.+^i-IPl /[0,27r)IPl-i Fix)dxi ■ ■ ■ dX|p|_i 

= (27r)IPl-iA'p,„ ^^^^ 

when x\p\ is kept fixed at an arbitrary value (this is straight- 
forward by using the methods from the proof of Proposition [T] 
and ([89]l), we get that the above equals 

ifp,„(2^)l''l'i / pUxipiy^^dxyi, 
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Appendix I 
The proof of Proposition[6] 

Proposition |6] will follow directly if we can prove the 
following result: 

Lemma 3: Let ojk (1 < k < n) he the uniform distribution 
on [M^, 22tfe] and define c^Ai,...a„ (0 < A, < 1, Ai + • • • + 
A„ ~ 1) as the phase distribution with density p^^^ — 
AiPc^i H 1- XnPui^- Then 

Proof: This follows immediately by noting that 



P,'^Ai,...,A„ 



X / (Aip^,(x) + --- + A„p^„(x))l''lda; 

JQ 



((Ai) 



-(A, 



\p\ 



2lT 



((Ai)!"! / \^Ax)\p^dx + -- 



+ (A„)I''I / P^A^yP^dx) 
Jo 

ifp,„(2^)l''l-i((Ai)l''l+... + (A„)l''l) 



<.27r 

X / p^-^{x)dx 

JO 



if, 



where we have used that a;^''' + • • • xlf ' constrained to xi + 
• • • + a;„ = 1 achieves its minimum for xi = ■ ■ ■ = Xn = -■ 



Appendix J 
The proof of Theorem|4] 

The contribution in the integral Kp^^^M comes only from 
when the oji coincide with the atoms of p. Actually, we 
evaluate 



in points on the form lo ~ ai — aj. This 



evaluates to N^p^ when all uji are chosen equal to the same 

T j N UJ 

atom aj. Since lim^v^^oo jv(i-eJ") ~ ^ ™y fixed uj ^ Q, 
limAT^oo Kp.uj.NN'^'^ — when oj is chosen from nonequal 



atoms. ( |52] i (with additional l/A^-factors) thus becomes 
giving rise to p 

]Sf\p\-2n-l^\p\-lj^-\p\ 

iE^N-P?+ap.NN^)) 
Di(iV)(ji,Ji)D2(7V)(j2,i2) 
•••X D„(iV)(j„,j„), 

where limjv^oo ap,Af — 0. Multiplying both sides with N and 
letting N go to infinity gives 



(97) 



^lim N\p\-^cM-' 



_ ^p'l + ap^N ) iip. 

pey(n) 

It is clear that this converges to when p 7^ 0„ (since |p| < n 
in this case), so that the limit is 



which proves the claim 



Appendix K 
The proof of TheoremO 

We need the following identity ll36ll : 

where sgn{x) = 1 if a; > 0, sgn{x) = — 1 if a; < 0, and 
otherwise. From this it follows that 

S-ooPi\^ ~ a.; I 6-'"'= da; = 



2p.e--m^cos(ii^) 



(98) 



Note that the measure with density p, has the same asymptotics 
near ai as the measure with density pi\x — ai\~^ on 



1 - s 



1-s 



As in the proof in Appendix |j] the integral for the expansion 
coefficients is dominated by the behaviour near the points 
(ai,...,ai). To see this, note that the behaviour near the 
singular points on the diagonal is O {s{\p\ — n) — 1) when 
polynomic growth of order s of the density near the singular 
points is assumed. This is very much related to ( |93] ) in 
Appendix IhI since Kp^^ here in a similar way can be bounded 
by (taking into account new powers of N) 



C 



1 



JVn + "s + l-|p 



X X] o<fci,.-.,fcipi<N nr=i 



-kb(T)\ 



(99) 



all ki different 

In ( l99b . the A^"-factor appears in exactly the same way as 
in the proof of Theorem [3] in Appendix iHl N^^p^ appears as 
a volume in R'''', and N^p\'' comes from evaluation of the 
density in the points Xi — 1 < « < |p|)- Since has a 
bounded integral around 0, and since the sum still converges 
(it is dominated by (|93]l), (|99]l is 

0{s{\p\-n)-l). 
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This has it's highest order when \p\ ~ n, so that we can restrict 
to looking at 0„. Note also that we may just as well assume 
that Plj{x) is identical to — Wij"* at an interval around 
u!i, since lima;^ct. \x — ai\''Pi^{x) = Pi implies that 



Plo{x) ^ Pi\x - Uil ^ + k{x)\x - UJi\' 



(100) 



where lima;^^^; k{x) = 0. It is straightforward to see that the 
contribution of the second part in dlOOb to ( |99] l vanishes as 
N oo, so that we may just as well assume that Puj{x) is 
identical to pi\x — uji\^'' at an interval around Ui, as claimed. 
Also, since 

lim / a;-"e^""=dx = 

for all e > 0, and since the contributions from large n dominate 
in ( lion below (since " diverges), it is clear that we 

can restrict to an interval around uji when computing the limit 
also (since p^ is continuous outside the singularity points, this 
follows from Theorem |3] and due to the additional -^-factor 
added to ([T)). After restricting to 0„, multiplying both sides 
with N, summing over all singularity points, and using 
we obtain the approximation 



E E 



X ( 2paT{l — s) COS 

Qj{ik-l—ik)aa. 



(1 - s)7r 



n 



to 



xtrL(DiiN)) X • • • X trL(Dn{N)) (101) 
Since Ilfc^i e^^''=-i~''='"" = 1, we recognize 

= (2r(l-s)cOs((i^))"(EaP'a)x 



as a factor in ( llOlb such that the limit of ( IIOII ) as oo 
can be written 

n 

c"-i lim lim T] trL (TtJN)) . 



i=l 



It therefore suffices to prove that limjv^oo 'z'-"'^-' — 'Z^"''- To 
see this, write 



N- 



1 



1 



\^k-l - ^k\ 



:1-S 



1 



1 



N 



N N 



Summing over all 1 < ii, .... z„ < A^, it is clear from this that 
qin,N) ^^jj viewed as a Riemann sum which converges to 

qin) as A^ -> oo. 



Appendix L 
The proof of Theorem[2]and Corollary[T] 

Proof of Theorem 13 we define Sj to be the blocks of a. 



I.e. 



Sj = {k\ik =j}. 
Note that Theorem |3] guarantees that the limit Kp,^ 



N- 



. Kp,i^,N exists. The partition p simply is a grouping 



lim 

of random variables into independent groups. It is therefore 
impossible for a block in p to contain elements from both 
and S2, so that any block is contained in either Si or 82- As 
a consequence, p < a. g 
Until now, we have not treated mixed moments of the form 

T>,{N)Y,,YZT>2{N)Y,,YZ • • • X D„(7V)V,,V,^, 

which are the same as the mixed moments of Theorem [T] 
except for the position of the 'Di{N). We will not go into 
depths on this, but only remark that this case can be treated 
in the same vein as generalized Vandermonde matrices by 
replacing the density pj (or p\ in case of continuous gener- 
alized Vandermonde matrices) with functions po.{x) defined 
by PDi{x) = D,,(A^)([La;J, [Lx\) for < a; < 1. This also 
covers the case of mixed moments of independent, generalized 
Vandermonde matrices (and, in fact, there are no restrictions 
on the horizontal and vertical phase densities p^. and px. for 
each matrix. They may all be different). The proof for this is 
straightforward. 

Proof of Corollary [TJ this follows in the same way as 
Proposition |3] is proved from Proposition |2] by only consid- 
ering p which are less than a, and also by using Theorem [3] 
cr are for the listed moments {{!}, {2}}, {{1, 3}, {2, 4}}, and 
{{1,3, 5}, {2, 4, 6}}, respectively. . 



Appendix M 
The proofs of Proposition[7]and[8] 

The moments E [tr„ (W*)] will be related to the moments 
Pi through three convolution stages: 

1) relating the moments of W with the moments of 

1 



r = VP5 ( -^ss^ I p2v^ 



from which we easily get the moments of 
1 



(102) 



S = ( -^SS^ ) P3V^VP4, (103) 



2) relating the moments of S with the moments of 

T = PV^V, (104) 

3) relating the moments of T with the moments of P. 
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For the first stage, the moments of W and F relate through 
the formulas 



where cg ^ ^, and T = PV^V. (fT05Tl-(fT07]i. (fT08Tl-(fT09]l, 
and Jl 10| i-( fTl3l ) can be combined to 



E [ir„ (W)] 
E [ir„ (W2)] 



E [trn (W^)] = 



E[trNiT)]+a^ (105) 

E [trN (r^)] 

+2a^{l + ci)E[trN (F)] 

W(l + ci) (106) 

E [trN (r^)] 

+3(7^1 + ci)E [trN (r^)] 
+3cr^CiE 



E [trn (W)] 
E [tr„ (W2)] 

E [trn (W^)] 



C2E [trL (T)] + (j^ 
C2£; [ir-L (t2)1 + C2C3S 



(114) 



(^r-L (T))^ 



-2a\c2 + C3)E [trL (T)] +^4(1 + c015) 



= C2 (^1 

+SC2C3E [{trL (T)) (trL (T^))] 



-C24E 



(tTL {T)Y 



+3^4 f c? 



-a*^ f c? + 3ci 



(i^A^ (r)) 

|-3ci + 
1 



C1C2 



(l + ci)c2 + ^)i?[tr, (T^)] 



+3a^C3{c3 + 2c2)E (ir^ (T)) 



i + (r)] 



1 



Sci + l + ^]c2E [trL (T)] 



(107) 



3C1 + 1 + — 



(116) 



which are obtained by replacing R in |[33l by VP 2 S, with 
c — ci = ^. For the second part of the first stage, note that 



Up to now, all formulas have provided exact expressions for 
the expectations. For the next step, exact expressions for the 
expectations are only known when the phase distributions are 
uniform, in which case the formulas are given by Theorem |2l 



C2E[trL{T)] = 
C2E [trL (T^)] = 

C2E [trL (T^)] = 



E[trN{T'')] = C2E 



E 



4E 



trL [S" 



(108) 
(109) 



E 
E 



(tTL my 

(trL (T))= 



C2trL{P) (117) 
{1 ~ N-^) C2trL{P^) 
+cl{trL{P)f (118) 
{I - 3N-^ + 2N-^) C2trL{P^) 
+3(l-iV-i) cltrL{P)trL{P^) 
+cl{trL{P)f (119) 
trLiPf (120) 

trLiPf (121) 



(122) 



where C2 = ^. We can now apply Theorem |2] to obtain 



E[{trL iT)){trL (T^))] 
- (1 - N-^) tri(P)tri(p2) + c2itrLiP)f. 

If the phase distribution uj is not uniform. Theorem [T] and 
Theorem [3] gives the following approximation: 



C3E 
C3E 

C3E 



trL (s) 
trL (S^) 

trL (S^) 



= C3E[trL{T)] 
= C3E[trL{T^)] 

'{tTL mf 



(110) 



C2E [trL (T)] 
C2E [trL (T^)] 
C2E [trL (T^)] 



~clE 



(111) 



E 
E 



{l + K-^)c3E [trL (t3)] 
+3clE[{trLT) trL (T^)] 



(trL my 

{trL m)f 



C2trL{P) (123) 
C2trL{P^) + clhitrLiPyf (124) 
C2trL{P^) + 3cll2trL{P)trL{P^) 
+cll3{trL{P)f (125) 
{trLPf (126) 

{trLPf 



-4E 



{trL {mr 



E[{trL {T)){trL (T^))] 
«iri(P)iri(p2)+c2/2(tri(P))3, 



(127) 
(128) 



(112) 



E 



trL is 



E 



{trL {T)y 

^ E[trL{T^)], (113) 



KL 



where the approximation is 0{N ^), and where Ik is defined 
by dull. 

Proposition [8] is proved by combining (II 14b - (ll 16t with 
(II 17| |- (I122| |. while Proposition |7] is proved by combining 
(fTT4] i- (fTT6] l with (fT23T l- (fT28] l. Proposition |9] is proved by first 
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observing that the roles of L and N are interchanged, since 
the Vandermonde matrix is replaced by its transpose. This 
means that we obtain the formulas ( lll4l )-( fTT6l l. with ci and 
C3 interchanged, and C2 replaced with ^. The matrix T is 
now instead VV^, and these can be scaled to obtain the 
moments of V^V. Finally the integrals /„ or the angle a 
can be estimated from these moments, using (ll23b -( fT28T l with 
the moments of P replaced with 1 (since no additional power 
matrix is included in the model). 

Matlab code for implementing the steps ( I105b - (I107| |. ( IllOb - 
(fml) . and (fTT7Ti-(fT22l) can be found in ||34| . 
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